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I, the undersigned, being an officer duly authorised in accordance with Section 74(1) and (4) 
of the Deregulation & Contracting Out Act 1994, to sign and issue certificates on behalf of the 
Comptroller-General, hereby certify that annexed hereto is a true copy of the documents as 
originally filed in connection with the patent application identified therein. 



In accordance with the Patents (Companies Re-registration) Rules 1982, if a company named 
in this certificate and any accompanying documents has re-registered under the Companies Act 
1980 with the same name as that with which it was registered immediately before re- 
registration save for the substitution as, or inclusion as, the last part of the name of the words 
"public limited company" or their equivalents in Welsh, references to the name of the company 
in this certificate and any accompanying documents shall be treated as references to the name 
with which it is so re-registered. 



In accordance with the rules, the words "public limited company" may be replaced by p. I.e., 
pic, P.L.C. or PLC. 



Re-registration under the Companies Act does not constitute a new legal entity but merely 
subjects the company to certain additional company law rules. 
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COMPOUNDS FOR TARGETING 

The present invention relates to cytotoxic compounds that have a high 
5 avidity for, and can be targeted to, selected cells. Specifically, the 

invention provides compounds comprising a cytotoxic portion having 
DNA endonucleolytic activity and a target-cell specific portion having 
specificity for human polymorphic epithelial mucin (PEM). 

10 Background 

The cell-specific targeting of compounds that are directly, or indirectly, 
cytotoxic has been proposed as a way to combat diseases such as cancer. 
Bagshawe and his co-workers have disclosed (Bagshawe (1987) Br, 7. 
15 Cancer 56, 531; Bagshawe et al (1988) Br. J. Cancer 58, 700; WO 

w 

88/07378) conjugated compounds comprising an antibody or part thereof 
and an enzyme, the antibody being specific to tumour cell antigens and the 
enzyme acting to convert an innocuous pro-drug into a cytotoxic 
compound. The cytotoxic compounds were alkylating agents, e.g. a 
20 benzoic acid mustard released from /wzra-N-bis(2- 
chloroethyl)aminobenzoyl glutamic acid by the action of Pseudomonas sp. 

CPG2 enzyme. 

An alternative system using different pro-drugs has been disclosed 
25 (WO 91/11201) by Epenetos and co-workers. The cytotoxic compounds 
were cyanogenic monosaccharides or disaccharides, such as the plant 
compound amygdalin, which release cyanide upon the action of a J3- 
glucosidase and hydroxynitrile lyase. 
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In a further alternative system, the use of antibody-enzyme conjugates 
containing the enzyme alkaline phosphatase in conjunction with the pro- 
drug etoposide 4 , -phosphate or 7-(2 ? -aminoethyl phosphate)mitomycin or 
5 a combination thereof have been disclosed (EP 0 302 473; Senter et al 
(1988) Proc. Natl Acad. Set USA 85, 4842). 

Rybak and co-workers have disclosed (Rybak et al (1991) /. Biol Chem. 
266, 21202; WO 91/16069) the cytotoxic potential of a monomeric 
10 pancreatic ribomiclease when injected directly into Xenopus oocytes and 
the cytotoxic potential of monomeric RNase coupled to human transferrin 
or antibodies directed against the transferrin receptor. The monomeric 
RNase hybrid proteins were cytotoxic to human erythroleukaemia cells in 
vitro. 

15 

Other approaches are the in vivo application of streptavidin conjugated 
antibodies followed, after an appropriate period, by radioactive biotin 
(Hnatowich et al (1988) /. Nucl Med. 29, 1428-1434), or injection of a 
biotinylated mAb followed by radioactive streptavidin (Paganelli et al 
20 (1990) Int. J. Cancer AS, 1184-1189). A pilot radioimmunolocalisation 
study in non-small cell lung carcinomas was conducted with encouraging 
results (Kalofonos et al (1990) /. Nucl Med. 31, 1791-1796). 

Apart from these examples, it is rather more common to see biotinylated 

25 antibodies and streptavidin-enzyme conjugates, which are used in enzyme- 
linked immunosorbent assays. 

These previous systems have used relatively large antibody-enzyme, 
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antibody-streptavidin or antibody-biotin conjugates and may comprise 
portions of non-mammalian origin which are highly immunoreactive. 

We have now devised improved compounds for targeting cells to be 
5 destroyed. 

Summary of Invention 

A first aspect of the invention provides a compound comprising a target 
10 cell-specific portion and a cytotoxic portion characterised in that die target 
cell-specific portion comprises an humanised monoclonal antibody having 
specificity for polymorphic epithelial mucin (PEM), or an aniigen binding 
fragment thereof, and the cytotoxic portion has endonucleolytic activity. 

15 By "target cell specific" portion we mean the portion of the compound 
which comprises one or more binding sites which recognise and bind to 

9 

polymorphic epithelial mucin (PEM) on the target cell. Upon contact with 
the target cell, ,the target cell specific portion is preferably internalised 
along with the cytotoxic portion. Such internalisation results in the 
20 cytotoxic portion being delivered to the cell cytosol, where it has access 
to the cell's nucleic acid molecules. 

The target cell-specific portion of the compounds of the invention 
comprises an humanised monoclonal antibody having specificity for 
25 polymorphic epithelial mucin (PEM), or an antigen binding fragment 
thereof. 

Polymorphic epithelial mucin, or PEM, is a component of the human milk 
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fai globule. PEM is expressed by cells ia several body tissues and is also 
found in urine. Significantly, PEM is known to be expressed in epithelial 
cancer cells, notably in ovarian, gastric, colorectal and pancreatic cancer 
cells. 

5 

Monoclonal antibodies which will bind to PEM are already known, but in 
any case, with today's techniques in relation to monoclonal antibody 
technology, antibodies can be prepared to most antigens. The antigen- 
specific portion may be a whole antibody, a part of an antibody (for 

10 example a Fab or F(ab'>2 fragment), a synthetic antibody fragment (for 
example a single chain Fv fragment [ScFv]), or a pqmde/peptidomimetic 
or similar. Suitable monoclonal antibodies to selected antigens may be 
prepared by known techniques, for example those disclosed in 
'Monoclonal Antibodies: A manual of Techniques " H Zola (CRC Press, 

15 1988) and in Monoclonal Hybridoma Antibodies: Techniques and 
Applications", J G R Hurrell (CRC Press, 1982) and Antibody 
Engineering, A Practical Approach, McCafferty, J. et al, ed. (IRL Pres, 
1996). 

20 By 'humanised monoclonal antibody* we include monoclonal antibodies 
having at least one chain wherein the framework regions are 
predominantly derived from a first, acceptor monoclonal antibody of 
human origin and at least one complimentarity^etennining region (CDR) 

is derived from a second, donor monoclonal antibody having specificity 
25 for PEM. The donor monoclonal antibody may be of human or non- 
human origin, for example it may be a murine monoclonal antibody. 

Preferably, both chains of the humanised monoclonal antibody comprise 
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CDRs grafted from a donor monoclonal antibody having specificity for 
PEM. 

Advantageously, the CDR-grafted (i.e. humanised) chain comprises two 
5 or all three CDRs derived from a donor antibody having specificity for 
PEM. 

Conveniently, the humanised monoclonal antibody comprises only human 
framework residues and CDRs from a donor antibody having specificity 
10 for PEM. 

However, it will be appreciated by those skilled in the art that in order to 
maintain and optimise the specificity of the humanised antibody it may be 
necessary to alter one or more residues in the framework regions such that 
15 they correspond to equivalent residues in the donor antibody. 

Conveniently, the framework regions of the humanised antibody are 
derived from an human IgG monoclonal antibody. 

20 Methods of making humanised monoclonal antibodies are well-known in 
the art, for example see Jones et al (1986) Nature 321:522-525, 
Riechmann et al (1988) Nature 332:323-327, Verhoeyen et al (1988) 
Science 239: 1534-1536 and EP 239 400 (to Winter). 

25 In a preferred embodiment of the first aspect of the invention, the target 
cell-specific portion comprises an humanised HMFG-1 monoclonal 
antibody or an antigen binding fragment thereof. 
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HMFG antibodies are raised against human milk fat globule (HMFG), in 
a delipidated state (see Taylor-Papadimiriou et aL, 1981, Int. J. Cancer 
28:17-21 and Gendler et al^ 1988, J. Biol Chem. 236:1282-12823). 
HMFG-1 monoclonal antibodies bind to a particular component of 
5 HMFG, namely polymorphic epithelial mucin (PEM). Binding is thought 
to involve the amino acid sequence APDTR within the twenty amino acid 
tandem repeats of the mz/c-i gene product. 

Examplary humanised HMFG-1 antibodies are disclosed in WO 92/04380. 

10 

Advantageously, the target cell-specific portion is an humanised HMFG-1 
monoclonal antibody. 

In a preferred embodiment of the first aspect of the invention, the target 
15 cell -specific portion comprises a fragment of an humanised monoclonal 
antibody having specificity for polymorphic epithelial mucin (PEM), said 
fragment retaining the antigen binding properties of the parent antibody. 

The variable heavy (Vh) and variable light (VjJ domains of the antibody 
20 are involved in antigen recognition, a fact first recognised by early 
protease digestion experiments. Further confirmation was found by 
"humanisation* of rodent antibodies. Variable domains of rodent origin 
may be fused to constant domains of human origin such that the resultant 
antibody retains the antigenic specificity of the rodent parented antibody 
25 (Morrison et al (1984) Proc. Natl Acad. Sci. USA 81, 6851-6855). 

That antigenic specificity is conferred by variable domains and is 
independent of the constant domains is known from experiments involving 



Received 03-Apr-DD 15:24 



From-DMS9 S52201 



To-THE PATENT OFFICE Page 11 



03/04 '00 MON 14:25 FAX 01159 552201 



ERIC POTTER CLARK SON 



Eg] 012 



7 

the bacterial expression of antibody fragments, all containing one or more 
variable domains. These molecules include Fab-like molecules (Better et 
al (1988) Science 240, 1041); Fv molecules (Skerra et al (1988) Science 
240, 1038); disulphide-linked Fv molecules (Young et aL, 1995, FEBS 
5 Lett. 377:135-139); single-chain Fv (ScFv) molecules where the V H and 
V L partner domains are linked via a flexible oligopeptide (Bird et al (1988) 
Science 242, 423; Huston et al (1988) Proc. Natl Acad. ScL USA 85, 
5879) and single domain antibodies (dAbs) comprising isolated V domains 
(Ward et al (1989) Nature 341, 544). A general review of the techniques 
10 involved in the synthesis of antibody fragments which retain their specific 
binding sites is to be found in Winter & Milstein (1991) Nature 349, 293- 
299. 

By u ScFv molecules" we mean molecules wherein the V H and V L partner 
15 domains are linked via a flexible oligopeptide. 

Chimaeric antibodies are discussed by Neuberger et al (1988, 8th 
International Biotechnology Symposium Part 2, 792-799). 

20 The advantages of using antibody fragments, rather than whole antibodies, 
are several-fold. The smaller size of the fragments allows for rapid 
clearance, and may lead to improved tumour to non-tumour ratios. Fab, 
Fv, ScFv, disulphide Fv and dAb antibody fragments can all be expressed 
in and secreted from bacteria, such as E. coli, or eukaryotic expression 

25 systems such as Yeast or mammalian systems, thus allowing the facDe 
production of large amounts of the said fragments. 

Whole antibodies, and F(ab') 2 fragments are "bivalent". By "bivalent" we 
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mean that the said antibodies and F(ab') 2 fragments have two antigen 
combining sites. In contrast, Fab, Fv, ScFv, disulphide Fv and dAb 
fragments are monovalent, having only one antigen combining site. 

5 Preferably, the target cell-specific portion of the compounds of the 
invention comprises an antigen binding fragment of the humanised 
antibody selected from the group consisting of Fab-like molecules, such 
as Fab and F(ab')2» Fv molecules, disulphide-linked Fv molecules, ScFv 
molecules qtiH single domain antibodies (dAbs). 

10 

More preferably, the target cell-specific portion comprises a Fab molecule 
or a F(ab') 2 molecule. 

Yet more preferably, the target cell-specific portion comprises at least a 

15 part of one or both of the amino acid sequences shown in Figure 3. 

Most preferably, the target cell-specific portion comprises both of the 
amino acid sequences shown in Figure 3 . 

20 Preferably, the target cell-specific portion recognises the target cell with 
high avidity. 

By "high avidity" we mean that the target cell-specific portion recognises 
the target cell with a binding constant of at least = 10" 6 M, preferably 
25 at least K* = 10* M, suitably K„ = 10- 10 M S more suitably = 10" 11 M, 
yet more suitably still = 10 U M, and more preferably = 10~ l5 M or 
evenK* = 1Q- 18 M. 
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Preferably, the target cell-specific portion comprises an antigen binding 
fragment of an humanised HMFG-1 monoclonal antibody, e.g. an Fab or 
F(ab , )2 fragment thereof, wherein a hinge region contains a mutation 
(i.e. wherein the hinge is a variant or hybrid of a naturally occurring 
5 hinge). More preferably, the variant hinge comprises the amino acid 
sequence CCVECPPCPAPE. 

By 'cytotoxic portion' we mean a portion having endonucleolytic activity 

which is toxic to the cell if it is to reach, and preferably enter said cell. 

10 

In a preferred embodiment of the first aspect of the invention, the 
cytotoxic portion has DNA endonucleolytic activity. 

Advantageously, the cytotoxic portion is at least the catalytically active 
15 portion of a DNA endonuclease. 

Examples of known DNA endonucleases include bovine DNase I (see 
Worrall and Conolly, 1990, J. Biol. Chem. 265:21889-21895). Human 
pancreatic DNase I has also been cloned (see Shale et aL , 1990, Proc. 
20 Natl Acad. Set USA 87:9188-9192 and Hubbard et al 7 1992, New Eng. 
J. Med. 326:812-815). 

Preferably, the endonuclease is a mammalian deoxyribonuclease I. 

25 More preferably, the endonuclease is a human deoxyribonuclease L 

Most preferably, the cytotoxic portion comprises the amino acid sequence 
shown in Figure 2. 
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Preferably, the cytotoxic portion of the compound of the invention is 
capable of oligomerisation, e.g. dimerisation. Attachment of the target- 
cell specific portion to a cytotoxic portion capable of oligomerisation 
5 provides a method for increasing the number of binding sites to the target 
cell. For example, if the target cell-specific portion is joined to a portion 
capable of forming a dimer then the number of target cell-specific binding 
sites is two; if the target cell-specific portion is joined to a portion capable 
of forming a tetramer then the number of target cell-specific binding sites 
10 is four. The number of target cell-specific binding sites is greater than 
one and the compounds may therefore have a greater avidity for the target 
cell than do compounds which only have one target cell-specific binding 
site. 

15 It is preferable for the cytotoxic portion of the compound of the invention 
capable of oligomerisation to contain no interchain disulphide bonds nor 
intrachain disulphide bonds; to be well characterised; to be non-toxic; to 
be stable; to be amenable to preparation in a form suitable for pre-clinical 
or clinical use or be in pre-clinical or clinical use; and for the subunit 

20 monomers to have a high affinity for each other, that is they contain one 
or more subunit binding sites. 

Advantageously, the cytotoxic portion is of mammalian, preferably 
human, origin. The use of the said mammalian proteins as the cytotoxic 
25 portion of the compound of the invention is advantageous smce such s 
compounds are less likely to give rise to undesirable immune reactions. ^ 

* 

It will be appreciated by those skilled in the art that the cytotoxic portion 

^ 
1 

t 

Received 03-Apr-00 15:24 From-01159 552201 To-TME PATENT OFFICE 1= 



03/04 '00 MON 14:26 FAX 01159 552201 



ERIC POTTER CLARK SON 



©016 



11 

may be a variant of a naturally occurring endomiclease. 

By w a variant* we include cytotoxic portions comprising of a naturally 
occurring endomiclease wherein there have been amino acid insertions, 
5 deletions or substitutions, either conservative or non-conservative, such 
that the changes do not substantially reduce the endomiclease activity of 
the variant compared to that of the naturally occurring endomiclease. For 
example, the variant may have increased activity compared to the naturally 
occurring endomiclease 

10 

Such variants may be made using methods of protein engineering and site- 
directed mutagenesis commonly known in the art (for example, see 
Sambrook et aL, 1989, Molecular cloning: A Laboratory Manual. 2 nd 
edition, Cold Spring Harbor Laboratory Press, NY, USA). 

In an alternative embodiment, the endomiclease is a restriction 
endomiclease, such as a microbial type II restriction endomiclease. 
Exemplary type II restriction endonucleases include BamHl, HindUl, 
Msph Sau3Al, Hinfl, NotI and EcdKL. 

20 

In another preferred embodiment of the first aspect of the invention, a 
nuclear localization signal is incorporated into the compound. 

Preferably, the nuclear localization signal comprises a nuclear localization 
25 signal from the SV40 large T antigen (Kalderon et al , 1984. Cell 39:499- 
509), and specifically the amino acid sequence PKKKRKV. Inclusion of 
a nuclear localization signal encourages the compound of the invention to 
gain access to the chromosomal DNA during the periods of the cell sycle 
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when the nuclear membrane is intact, since the nuclear pores are 
permeable to large molecules incorporating said nuclear localization 

signal. 

5 In a further preferred embodiment of the first aspect of the invention, the 
target cell-specific portion and the cytotoxic portion are fused to create a 
fusion compound. 

By "fusion compound" we include a compound comprising one or more 
10 functionally distinct portions, wherein the distinct portions are contained 
within a single polypeptide chain produced by recombinant DNA 
techniques. 

Preferably, the target-cell specific and the cytotoxic portion of the fusion 
15 compound of the invention separated by a linker sequence, for example to 
allow greater flexibility of the portions relative to one another. 

More preferably, the linker sequence comprises a GG dipeptide. 

20 Most preferably the linker sequence is or comprises GG or GSGG. 

Alternatively, the target-cell specific and the cytotoxic portion of the 
compound of the invention are separate moieties linked together by any of 
the conventional ways of cross-linking polypeptides, such as those 
25 generally described in O' Sullivan etal Anal Biochem. (1979) 100, 100- 
108. For example, the antibody portion may be enriched with thiol 
groups and the enzyme portion reacted with a bifunctional agent capable 
of reacting with those thiol groups, for example the N-hydroxysuccinimide 
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ester of iodoacetic acid (NHIA) or N-succinimidyl-3-(2- 
pyridyldithio)propionate (SPDP). Amide and thioether bonds, for 

example achieved with m-maleimidobepzoyl-N-hydroxysuc^irjmide ester, 
are generally more stable in vivo than disulphide bonds. 

5 

A second aspect of the invention provides a nucleic acid encoding a 
compound according to the first aspect of the invention, or a target cell- 
specific portion or cytotoxic portion thereof. 

10 By "nucleic acid molecule" we include DNA, cDNA and mRNA 
molecules. 

A further aspect of the present invention provides a method of making a 
compound according to the first aspect of the invention, said method 
15 comprising expressing one or more nucleic acid molecules according to 
the second aspect of the invention in a host cell and isolating the 
compound therefrom. 

It is preferable that the two portions of the compound of the invention are 
20 produced as a fusion compound by recombinant DNA techniques, 
whereby a length of DNA comprises respective regions encoding the two 
portions of the compound of the invention either adjacent one another or 
separated by a region encoding a linker peptide which does not destroy the 
desired properties of the compound. The benefits in making the 
25 compound of the invention using recombinant DNA techniques are several 

fold. Firstly, it enables a high degree of precision with which the two 

portions of the compound can be joined together. Secondly, the 
construction of compounds which are "hetero-oligomeric" can be 
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controlled by the expression of the different recombinant DNA molecules 
encoding each of the different type of subunit of the "hetero-oligomer" in 
the same host cell. 



5 By "hetero-oligomer 11 we mean those compounds in which two or more 
different cell-specific portions are joined to either the same or to different 

subunits which are capable of oligomerisation. The expression, in die 

same host cell of two compounds, of A and B, each with different target 
cell specific portions but with a common second portion capable of 

10 oligomerisation will result in a mixed population of compounds. For 

example, if the common second portion is capable of dimerisation, three 
potential compounds will be produced: A 2 , AB and R,, in a ratio of 1:2:1, 
respectively. 



15 The separation of the desired compound with each of the different cell 
specific portions, that is AB, can be achieved by two step affinity 
chromatography. 



Application of the mixture of compounds to an affinity column specific for 
20 A will result in the binding of Aa and AB, These compounds are eluted 
from this first column, and then applied to an affinity column specific for 
B. This will result in AB, but not A 2 , being bound to the column. 
Finally, the desired product AB, can be eluted. 



25 Of course, the order in which the affinity columns are used is not 

important. 

The same principle of separating those compounds with two or more 
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different binding sites can be applied to the purification of the desired 
compounds from mixtures of other hetero-oligomers. 

Conceivably, the two portions of the compound may overlap wholly or 
5 partly. 

The nucleic acid is then expressed in a suitable host to produce a 
polypeptide comprising the compound of the invention. Thus, the nucleic 
acid encoding the compound of the invention or a portion thereof may be 
10 used in accordance; with known techniques, appropriately modified in view 
of the teachings contained herein, to construct an expression vector, which 
is then used to transform an appropriate host cell for the expression and 
production of the polypeptide of the invention. Such techniques include 

those disclosed in US Patent Nos. 4,440,859 issued 3 April 1984 to Rutter 
15 etaU 4,530,901 issued 23 July 1985 to Weissman, 4,582,800 issued 15 
April 1986 to Crowl, 4,677,063 issued 30 June 1987 to Mark et al 7 
4,678,751 issued 7 July 1987 to Goeddel, 4,704,362 issued 3 November 
1987 to Itakura et al, 4,710,463 issued 1 December 1987 to Murray, 
4,757,006 issued 12 July 1988 to Toole, Jr. et al % 4,766,075 issued 23 
20 August 1988 to Goeddel et al and 4,810,648 issued 7 March 1989 to 
Stalker, all of which are incorporated herein by reference. 

The nucleic acid encoding the compound of the invention or a portion 
thereof may be joined to a wide variety of other nucleic acid sequences for 
25 introduction into an appropriate host. The companion nucleic acid will 
depend upon the nature of the host, the manner of the introduction of the 
nucleic acid into the host, and whether episomal maintenance or 
integration is desired. 
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It will be appreciated that in order to prevent expression of the cytotoxic 
portion of the compound of the invention from killing the host cells in 
which it is expressed, it may be necessary to link the nucleic acid of the 
5 second aspect of the invention to a signal sequence capable of directing 
secretion of the expressed compound (or portion) out of the host cell. 
Signal sequences will be selected according to the type of host cell used. 
Exemplary signal sequences include the ompA signal sequence (for 
example, see Takahara et al. 1985. /. Biol Chem. 260(5) :2670-2674). 

10 

Generally, the nucleic acid is inserted into an expression vector, such as 
a plasmid. in proper orientation and correct reading frame for expression. 
If necessary, the nucleic acid may be linked to the appropriate 
transcriptional and rranslational regulatory control nucleotide sequences 

15 recognised by the desired host, although such controls are generally 
available in the expression vector. The vector is then introduced into the 
host through standard techniques. Generally, not all of the hosts will be 
transformed by the vector. Therefore, it will be necessary to select for 
transformed host cells. One selection technique involves incorporating 

20 into the expression vector a nucleic acid sequence, with any necessary 
control elements, that codes for a selectable trait in the transformed cell, 
such as antibiotic resistance. Alternatively, the gene for such selectable 
trait can be on another vector, which is used to co-transform the desired 
host cell. 

25 

Host cells that have been transformed by the recombinant nucleic acid of 
the invention are then cultured for a sufficient time and under appropriate 
conditions known to those skilled in the art in view of the teachings 
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disclosed herein to permit the expression of the polypeptide, which can 
then be recovered. 



Many expression systems are known, including bacteria (for example E. 
5 coli and Bacillus subtilis), yeasts (for example Saccharomyces cerevisiae 
and Pichiapastoris), filamentous fungi (for example Aspergillus), plant 
cells, animal cells (for example COS-1. COS-7, CHO, NIH 3T3, NS0 and 
BHK cells) and insect cells (for example Drosophila, SF9 cells). 

10 Those vectors that include a replicon such as a procaryotic replicon can 
also include an appropriate promoter such as a procaryotic promoter 
capable of directing the expression (transcription and translation) of the 
genes in a bacterial host cell, such as E. coli, transformed therewith. 



15 A promoter is an expression control element formed by a DNA sequence 
that permits binding of RNA polymerase and transcription to occur. 
Promoter sequences compatible with exemplary bacterial hosts are 
typically provided in plasmid vectors OTntaining convenient restriction 
sites for insertion of a DNA segment of the present invention. 

20 

Typical procaryotic vector plasmids are pUC18, pUC19, pBR322 and 
pBR329 (available from Biorad Laboratories, Richmond, CA, USA), 
p7rc99A and pKK223-3 (available from Pharmacia Piscataway, NJ, USA) 
and the pET system (T7 promoter, Novagen Ltd). 

25 , 

>. 

A typical mammalian cell vector plasmid is pSVL available from 
Pharmacia. Piscataway, NJ, USA. This vector uses the SV40 late 
promoter to drive expression of cloned genes, the highest level of 
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expression being found in T antigen-producing cells, such as COS-1 cells. 

An example of an inducible mammalian expression vector is pMSG, also 
available from Pharmacia. This vector uses the glucocorticoid-inducible 
5 promoter of the mouse mammary tumour virus long terminal repeat to 
drive expression of the cloned gene. 

Useful yeast plasmid vectors are pRS403-406 and pRS413-416 and are 
generally available from Stratagene Cloning Systems, La Jolla, CA 92037, 
10 USA. Plasmids pRS403, pRS404, pRS405 and pRS406 are Yeast 
Integrating plasmids (Yips) and incorporate the yeast selectable markers 
his3. trpl, leu2 and ura3. Plasmids pRS413-416 are Yeast Centromere 
plasmids (YCps). 

15 Further useful vectors for transformation of yeast cells, such as Pichia, 
include the 2fi plasmid p YX243 (available from R and D Systems Limited) 
and the integrating vector pPICZ series (available from Invitrogen). 

A variety of methods have been developed to operatively link DNA to 
20 vectors via complementary cohesive tenrrini. For instance, 
complementary homopolymer tracts can be added to the DNA segment to 
be inserted to the vector DNA. The vector and DNA segment are then 
joined by hydrogen bonding between the complementary homopolymeric 
tails to form recombinant DNA molecules. 

25 

Synthetic linkers containing one or more restriction sites provide an 
alternative method of joining the DNA segment to vectors. The DNA 
segment, generated by endonuclease restriction digestion as described 
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earlier, is treated with bacteriophage T4 DNA polymerase or E. coli DNA 
polymerase I, enzymes that remove protruding, 3' -single-stranded termini 
with their 3'-5'-exonucleolytic activities, and fill in recessed 3*-ends with 
their polymerizing activities. 

5 

The combination of these activities therefore generates blunt-ended DNA 
segments. The blunt-ended segments are then incubated with a large 
molar excess of linker molecules in the presence of an enzyme that is able 
to catalyze the ligation of blunt-ended DNA molecules, such as 
10 bacteriophage T4 DNA ligase. Thus, the products of the reaction are 
DNA segments carrying polymeric linker sequences at their ends. These 
DNA segments are then cleaved with the appropriate restriction enzyme 
and ligated to an expression vector that has been cleaved with an enzyme 
that produces termini compatible with those of the DNA segment. 

15 

Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including International 
Biotechnologies Inc, New Haven, CN, USA. 

20 A desirable way to modify the nucleic acid encoding the compound of the 
invention or a portion thereof is to use the polymerase chain reaction as 
disclosed by Saiki et al (1988) Science 239, 487-491 . 

In this method the nucleic acid to be enzymatically amplified is flanked by 
25 two specific oligonucleotide primers which themselves become 
incorporated into the amplified nucleic acid. The said specific primers 
may contain restriction endonuclease recognition sites which can be used 
for cloning into expression vectors using methods known in the art. 
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Exemplary genera of yeast contemplated to be useful in the practice of the 
present invention are Pichia, Saccharomyces, Kluyveromyces, Candida, 
Torulopsis, Hansenula, Schizosaccharomyces, Citeromyces, Pachysolen, 
5 Debaromyces, Metschumkoma, Khodosporidium, Leucosporidiwn, 
Botryoascus, Sporidiobolus, Endomycopsis , and the like. Preferred 
genera are those selected from the group consisting of Pichia, 
Saccharomyces, Kluyveromyces, Yarrawia and Hansenula. Examples of 
Saccharomyces are Saccharomyces cerevisiae, Saccharomyces italicus and 
10 Saccharomyces rovxii. Examples of Kluyveromyces are KLuyveromyces 
fragilis and Kluyveromyces lactis. Examples of Hansenula are Hansenula 
polymorphs. Hansenula anomala and Hansenula capsulata. Yarrowia 
lipolytica is an example of a suitable Yarrowia species. 

15 Methods for the transformation of 5. cerevisiae are taught generally in EP 
251 744, EP 258 067 and WO 90/01063, all of which are incorporated 
herein by reference. 

Suitable promoters for 5. cerevisiae include those associated with the 
20 PGK1 gene, GAU or GAL10 genes, CYC1, PHOS 9 TRP1 7 ADHL ADH2, 
the genes for glyceraldehyde-3-phosphate dehydrogenase, hexokinase, 
pyruvate decarboxylase, phosphofructokinase, triose phosphate isomerase, 
phosphoglucose isomerase, glucokinase, a-mating factor pheromone, a- 
mating factor pheromone, die PRB1 promoter, the GUT2 promoter, and 
25 hybrid promoters involving hybrids of parts of 5 1 regulatory regions with 

parts of 5' regulatory regions of other promoters or with upstream 
acti vation sites (eg the promoter of EP-A-258 067). 
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The transcription termination signal is preferably the 3' flanking sequence 
of a eukaryotic gene which contains proper signals for transcription 
termination and polyadenylation. Suitable 3 1 flanking sequences may, for 
example, be those of the gene naturally linked to the expression control 
5 sequence used, i.e. may correspond to the promoter. Alternatively, they 
may be different in which case the termination signal of the 5, cerevisiae 
AHD1 gene is preferred. 

The present invention also relates to a host cell transformed with a 
10 polynucleotide vector construct of the present invention. The host cell can 
be either procaryotic or eukaryotic. Bacterial cells are preferred 
procaryotic host cells and typically are a strain of E. coli such as 7 for 
example, the E. coli strains DH5 available from Bethesda Research 
Laboratories Inc., Bethesda, MD, USA, and RR1 available from the 
15 American Type Culture Collection (ATCC) of Rockville. MD, USA (No 
ATCC 31343). Preferred eukaryotic host cells include yeast and 
mammalian cells, preferably vertebrate cells such as those from a mouse, 
rat, monkey or human fibroblastic cell line. Preferred eukaryotic host 
cells include Chinese hamster ovary (CHO) cells available from the ATCC 
20 as CCL61, NIH Swiss mouse embryo cells NIH/3T3 available from the 
ATCC as CRL 1658 and monkey kidney-derived COS-1 cells available 
from the ATCC as CRL 1650 or WS0 cells. 

Transformation of appropriate cell hosts with a nucleic acid constructs of 
25 die present invention is accomplished by well known methods that 
typically depend on the type of vector used. With regard to 
transformation of procaryotic host cells, see, for example, Cohen et aL 
Proc. Natl Acad. Sci. USA, 69: 2110 (1972); and Sambrook et al, 
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Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY (1989). Transformation of yeast 
cells is described in Sherman et al, Methods In Yeast Genetics, A 
Laboratory Manual, Cold Spring Harbor, NY (1986). The method of 
5 Beggs. Nature, 275: 104-109 (1978) is also useful. With regard to 
vertebrate cells, reagents useful in transfecting such cells, for example 
calcium phosphate and DEAE-dextran or liposome formulations, are 
available from Stratagene Cloning Systems, or Life Technologies Lie, 
Gaithersburg, MD 20877, USA. 

Successfully transformed cells, Le. cells that contain a nucleic acid 
construct of the present invention, can be identified by well known 
techniques. For example, cells resulting from the introduction of an 
expression construct of the present invention can be grown to produce the 
15 polypeptide of the invention. Cells can be harvested and lysed and their 
DNA content examined for the presence of the DNA using a method such 
as that described by Southern. J. Mol Biol , 98: 503 (1975) or Berent et 
al, Biotech., 3: 208 (1985). Alternatively, the presence of the protein in 
the supernatant can be detected using antibodies as described below. 

20 

In addition to directly assaying for the presence of recombinant nucleic 
acid, successful transformation can be confirmed by well known 
immunological methods when the recombinant nucleic acid is capable of 
directing the expression of the protein. For example, cells successfully 
25 transformed with an expression vector produce proteins displaying & 
appropriate antigenicity. Samples of cells suspected of being transformed 
are harvested and assayed for the protein using suitable antibodies. 



. - 
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Thus, in addition to the transformed host cells themselves, the present 
invention also contemplates a culture of those cells, preferably a 
monoclonal (clonally homogeneous) culture, or a culture derived from a 
monoclonal culture, in a nutrient medium. Preferably, the culture also 
5 contains the protein. 

Nutrient media useful for culturing transformed host cells are well known 
in the art and can be obtained from several commercial sources. 

10 A third aspect of the invention provides a vector comprising a nucleic acid 
according to the second aspect of the invention. 

A fourth aspect of the invention provides a host cell comprising a vector 
according to the third aspect of the invention. 

Preferably, the host cell is a mammalian cell. 

More preferably the host cell is NS0. 

20 A fifth aspect of the invention provides a pharmaceutical composition 
comprising a compound according to the first aspect of the invention and 
a pharmaceutical^ acceptable carrier. 

The compounds and compositions of the invention are administered in any 
25 suitable way, usually parenterally, for example intravenously, 
intraperitoneally or, preferably (for bladder cancer), intravesically 
0*,£. into the bladder), in standard sterile, non-pyrogenic formulations of 
diluents and carriers, for example isotonic saline (when administered 
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intravenously). 

A sixth aspect of the invention provides a compound according to the first 
aspect of the invention for use in medicine. 

5 

The compounds and compositions of the invention may be used to treat a 
patient with any disease involving a dysfunction of a population of cells 
expressing PEM, said compounds and compositions selectively targeting 
and destroying said population of cells within a patient. For example, said 

10 compounds and compositions may be used in the treatment of cancer, e.g. 
cancer of the breast, ovaries, lung, stomach, intestines, blood etc. Thus, 
anti-tumour cell antigen antibodies can be used to deliver a cytotoxic 
portion with endonuclease activity to a tumour cell. Antibodies that are 
internalised upon contact with the target antigen are used, such that the 

15 cytotoxic portion enters the cytosol of the tumour cell, where it can trigger 
cell death. 

In principle, the compounds and compositions of the invention may be 
used to treat any mammal, including pets such as dogs and cats and 
20 agriculturally important animals such as cows, horses, sheep and pigs. 

Preferably, the patient is human. 

A seventh aspect of the invention provides the use of a compound 
25 according to first aspect of the invention in the preparation of a 
medicament for treating a mammal having said target cells to be 
destroyed. 
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Preferably, the medicament is for treating cancer, such as ovarian cancer. 

A eighth aspect of the invention provides a method of treating a mammal 
having target cells to be destroyed, the method comprising administering 
5 a compound according to the first aspect of the invention to said mammal. 

In a preferred embodiment of the seventh and eighth aspects of the 
invention, the mammal is a human. 

10 Preferably, die target cells to be destroyed are cancer cells. More 
preferably, die cancer cells are epithelial cancer cells, such as ovarian, 
gastric, colorectal and/or pancreatic cancer cells. Most preferably, the 
cancer cells are ovarian cancer cells. 

15 The invention will now be described in detail with reference to the 
following figures and examples: 

Figure 1 shows the complete coding sequence of human DNAse I. 

20 Figure 2 shows the DNAse I sequence used in the exemplary constructs 
(e.g. pAS41). 

Figure 3 shows the HMFG1 light chain insert (A) and heavy chain insert 
(B) used in the exemplary constructs (e.g. pAS6). 

25 

Figure 4 shows die linker and hinge-linker oligonucleotides used in (A) 
the whole antibody-DNase and (B) the Fab'2-DNase exemplary 
constructs. Note, in Figure 4(A) a deletion of one or more codons 



Received 03-Apr-QQ 15:24 



From-01 1 59 552201 



To-THE PATENT OFFICE 



Page 30 



03/04 '00 MON 14:30 FAX 01159 552201 



ERIC POTTER CLARKSON 



1)031 



26 



between the HMFG1 hinge and the linker is represented as aG. 



Figure 5 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct pAS23 Gihker sequence underlined). 



Figure 6 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct pAS27 Oinker sequence and NLS 
underlined). 



10 Figure 7 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct pAS34 (linker sequence underlined). 



Figure 8 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct pAS35 (linker sequence underlined). 
15 The lower case 4 g' represents a silent mutation caused by PCR 
amplification. 



Figure 9 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct pAS36 (linker sequence underlined). 
20 The lower case 'c' represents a silent mutation caused by PCR 
amplification. 



Figure 10 shows (A) the nucleotide sequence and (B) die translated amino 
acid sequence of exemplary construct pAS37 (linker sequence and NLS 
25 underlined). 



Figure 1 1 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct pAS38 (linker sequence and NLS 



s 
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underlined). The lower case 'g 1 represents a silent mutation caused by 
PCR amplification. 

Figure 12 shows (A) the nucleotide sequence and (B) the translated amino 
5 acid sequence of exemplary construct pAS39 (linker sequence underlined). 
The lower case *c' represents a silent mutation caused by PCR 
amplification. 

Figure 13 shows (A) the nucleotide sequence and (B) the translated amino 
10 acid sequence of exemplary construct pASlOl (linker sequence 
underlined). 

Figure 14 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct p AS 102 (hybrid hinge and linker 
15 sequence underlined) . 

Figure 15 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct pAS103 (hybrid hinge and linker 
sequence underlined). 

20 

Figure 16 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct p AS 104 (hybrid hinge and linker 
sequence underlined, mutations (compared to p AS 103) at positions 775 
and 924 are shaded). 

25 

Figure 17 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct pAS105 (linker sequence and NLS 
underlined). 
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Figure 18 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct p AS 106 (hybrid hinge + linker 
sequence and NLS are underlined). 

5 

Figure 19 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of exemplary construct pAS107 (hybrid hinge + linker 
sequence and NLS are underlined). 

10 Figure 20 shows a schematic diagram of the pEE6 expression vector used 
in the exemplary constructs. . 

Figure 21 shows autoradiography from immimo-precipitation experiments 
with metabolically labelled transient transfectants: 

15 

GEL A 

Lane 1 shows the precipitation of supernatant from mock-transfected 
cells. 

Lane 2 is from cells transfected with hHMFG-1 (construct 6) giviug 
20 expected molecular weights of about 51.2 and 26.4 kDa for the 

heavy and light chains, respectively. 

Lane 3 shows construct 34 antibody construct which has human 
DNase I fused to the C-tenninus of the heavy chain gene. As 
expected, the size of the heavy chain gene has increased to about 
25 80.7 kDa. 

Samples from whole antibody DNase I constructs 35, 36 and 39 
were run on the gel (Lanes 4 to 6) but were not sufficiently well 
expressed to be visible, in this experiment. 
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In subsequent experiments using this method, construct 39 was 
detectable but weak, and constructs 35 and 36 were detectable but 
very weak. Constructs 37 and 38 have not been tested in this assay 
system. 

5 Lanes 8 to 10 are fusion of humanised HMFG1 F(ab') 2 with human 

DNase I (constructs 41, 23 and 102, respectively). F(ab') 2 alone was 
included in this set of experiments (lane 7, construct 41) but did not 
express, this was included in later experiments (see gels C and D). 
In addition to the light chain (about 26.4 kDa) and the Fd-DNAse 

10 I fusion (about 56.6 kDa), a third major band is observed at around 

40 kDa. Interestingly, this band is observed in the humanised 
HMFG-1 fusions but not in the antibody alone. Since an anti-Ffab'^ 
antibody was used for immuno-precipitation, it is unlikely that this 
can be proteolysis between immunoglobulin and DNase I sequence. 

15 It probably represents a population of polypeptide produced by 

premature transcriptional termination (due to DNase I sequence in 
the 3 '-end of the fusion mRNA) , 



GEL B 

20 This is the non-reducing gel counterpart to gel A, described above. 

Lane 1 is the mock-transfected control cells and lanes 2 and 3 are 
from the cells transfected with humanised HMFG1 alone (construct 
6) and the humanised HMFG-1 fused at the C-terminus to human 
DNase I, respectively. As before, lanes 4 to 6 are from cell 

25 supernatants from cells transfected with constructs 35. 36 and 39. 

The gel shows that both the whole antibody and the antibody-DNase 
I fusion are assembled, with the DNase fusion giving a higher 
molecular weight compared to the antibody alone. 
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Figure 22 shows a typical standard curve used to determine the 
concentration of PDTRP-binding material in the supernatants of transiently 
transfected L761h cells. Each point on the curve has been determined 
5 twice. 

Figure 23 shows typical standard curves used to determine the 
concentration of bovine DNAse L 

10 Figure 24 shows corrected DNase I activity in transiently expressed 
humanised HMFG1 whole antibody-human DNAse I fusions (Le. pAS34, 
pAS34, pAS35 and pAS6(control]). 

Figure 25 shows the corrected DNAse I activity in transiently expressed 
15 humanised HMFG1 F(abVbuman. DNase I fusions (i.e. pASlOl, 
pAS102, pAS103 and pAS41 [control]). 

Figure 26 shows results of the cytotoxicity assay. 

20 Figure 27 shows the % of MCF7 cells killed afetr incubation with the 
exemplary constructs. 
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EXAMPLES 

(A) Mammalian expression of humanised HMFGl-DNase constructs 

5 The human HMFG1 light and heavy chain (with or without engineering 
a fusion to human DNase I), were cloned into the pEE6 expression vector 
system for expression in mammalian CHO or myeloid NS0 cells (see 
figure 20). The vector system was originally developed by Celltech Ltd 
(UK) and is now owned by al-Lonza (see Young & Owens, 1994, /. 

10 Immunol Meth. 168:149-165). The vector consists of two human 
cytomegalovirus promoters (hCMV) for both the heavy and light chain 
genes. Each transcription unit is completed by the poly-adenylation signal 
(pA) with an optional immunoglobulin terminator sequence (Ig term.) 
located between the heavy and light chain transcription units. Propagation 

15 in E.coli can be selected for by the presence on an ampicillin resistance 
gene (not shown in Fig 20). The inclusion of a glutamine synthetase gene 
(GS) in the vector allows the stable NS0 transfectomas to be selected by 
growth in glutamine free media, since NS0 cells are GS0 and cannot 
otherwise grow in glutamine free media. 

20 

Exemplary humanized HMFGl-DNAse I fusion constructs of the 
invention are detailed in figures 5 to 19. 

(B) Immuno-precipitation of metabolically labelled transient 
25 transfectants 

CHO-L761h cells (Cockett et aL, 1990, Nuc. Acids Res. 19:319-325) 
were transfected, according to the modification of Gorman et al, 1985), 
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with expression vectors containing either whole HMFG1 antibody or 
F(ab') 2 fragment of the antibody along with the various fusion constructs 
of their respective heavy chains and human DNase I. The cells were then 
incubated with either 50 /iCi 35 S methione for 72 h in methionine-free 
5 medium. Secreted product was precipitated with a rabbit anti-human 
F(ab') 2 antibody bound to protein A Sepharose. Bound material was 
eliited in either reducing or non-reducing SDS-PAGE loading buffer and 
run on gels. The autoradiographs (see Figure 21) above were generated 
from those gels after drying them. 

10 

(C) Estimation of the efficiency of DNase constructs in supernatants 

Introduction 

15 This set of experiments was designed to standardise the amount of 
construct in a given DNase I activity assay and to allow us to comment on 
the amount of activity a particular construct possesses. Given that the 
antibody-DNase I fusions are so different to the F(ab , ) 2 -DNase I fusions 
it is best not to compare the two groups. Once we have purified the 

20 protein, we will have a better idea of the exact molecular configuration of 
all species. Then, and only then, will it be sensible to compare amongst 
groups. 

Determination of concentration of constructs 

25 

The concentration of constructs in supernatants from transiently 
transfected L761H cells was determined in a PDTRP-binding ELISA. To 
each well of a Maxisorb 96- well ELISA plate (Nunc) was added 100 /xl of 
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carbonate buffer containing 100 ng of recombinant GST-(PDTRP) 7 fusion 
protein (Gendler et al., 1990, J. Mol Biol. 265:15286-93). After 
overnight binding at 4°C, the plate was washed three times in PBS-Tween 
(i.e. PBS containing 0.05 % Tween-20). The plate was then blocked with 
5 three 3-minute washes of PBS-Tween containing 1 % BSA. 

For each construct, 100 pi of supernatant was added to a well on the plate. 
In addition, hHMFG-1 of known concentration was serially diluted down 
the plate using doubling dilutions in 100 id of PBS-Tween per well. The 

10 plate was incubated for a further 1 h at 30°C, then 200 ng of MC135 anti- 
human kappa light chain antibody (binding site) in 100 n\ of PBS-Tween 
was added to each well for 1 h at 30°C. After three 3-minute washes in 
PBS-Tween, 100 pi of anti-mouse IgG-peroxidase conjugate (Jackson 315- 
035-045), diluted 1:2000 in PBS-Tween, was added to each well and 

15 incubated for 1 h at 30°C. Following a final set of three 3-minute washes 
in PBS-Tween, 100 /d of TMB substrate (Sigma) was added to each well 
of the plate and, after a colour developed, the optical density at 630 nm 
of the solution in each well of the plate was determined. 

20 Results 

(see Figure 22) 

(D) Corrected bovine DNase I standard curves and DNase assay 

25 

DNase activity was determined using a modification of the methyl green- 
DNA complex degradation method (Sinicropi et al., 1994, Analyt. 
Biochem. 222:351-358). Briefly, a 1:1 solution of the assay buffer and 
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methyl green-salmon sperm DNA complex was mixed together to give a 
total volume of 0.2 ml. To this, 0. 1 ml of tissue culture supernatant from 
transiently transfected CHO-L761h ceUs was added and the mixture 
incubated at 37 °C. DNA cleavage by DNase results in a reduction in 
5 absorbance at 620 nm. Figure 23 shows a standard curve produced with 
various concentrations of bovine DNase I over a number a time point. 

Figures 24 and 25 show DNAse activity for the whole HMFG1 antibody- 
and F(ab')2 - DNase fusions, respectively. 

10 

(El Cytotoxicity of DNAse constructs 
Method 

15 DNase constructs were transfected into CHO L761h cells using a calcium 
phosphate co-precipitation method (Gorman et at., 1985, In: DNA cloning 
(2nd edition), Glover A(ed.), Academic Press, NY, 163-188). Included 
in the experiment were negative controls, consisting of cells transfected 
with TE buffer alone or with TE buffer and pEE6 expression vector. In 

20 addition to these controls, vectors that express hHMFG-1 (pAS6) and 
F(ab')2 of hHMFGl (both with specificity for PEM but without DNase 1) 
were included. 

The supernatant from these cells was harvested after 72 h of expression, 
25 followed by centrifugation to remove dead cells. MCF-7 cells were 
incubated for 1 h at 37 "C with an aliquot of each of these supematants. 
The amount of cellular lactate dehydrogenase (LDH) released from the 
MCF-7 cells due to the cytotoxicity of the supernatant was determined 
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10 



using the CytoTox96 cytotoxic assay kit (Promega). Total lysis ('total 
LDH") was detennined by measuring the target cell maximum LDH 
release using the kits lysis solution. The percentage of cells killed was 
then calculated as the proportion of the LDH released to the total LDH 
released. For each construct, the cytotoxicity assay was performed in 
quadruplicate, except for assay of pAS38 and 39, which were performed 
in triplicate. The values of LDH release for each construct were 
compared against either F(ab') 2 or whole antibody, or each other, using 
a one-tailed t-test in Excel. 



Results 



killim 



pAS6 (HMFG1 alone) or with pAS41 (F(ab') 2 alone). All 01 tne 
1 5 hHMFGl F(ab') 2 -DNase I constructs kill significantly more cells than the 
F(ab') 2 fragment alone (p< 0.00193) and all of the anttbody-DNase I 
constructs kill significantly more cells than antibody alone (p <0.00783), 
except for perhaps pAS34 (p < 0.021). 

20 (F) Use of the DNase-I/huHMFG-1 Fab fusio n protein in the treatment 
of ovarian cancer 

Patients diagnosed with ovarian cancer are treated by intravenous injection 
of the DNasel/huHMFG- 1 Fab fusion protein. Typically, a dose of 
25 between 1 to 100 mg will be administered weekly. 

Therapeutic response is measured by the normal clinical procedures that 
are well Imown in tne art, for example radio-imaging methods. 
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CLAIMS 

1 . A compound comprising a target cell-specific portion and a cytotoxic 
portion characterised in that: 

(i) the target cell-specific portion comprises an humanised monoclonal 
antibody having specificity for polymorphic epithelial mucin (PEM), 
or an antigen binding fragment thereof; and 

(ii) the cytotoxic portion has endonucleolytic activity. 

2. A compound according to Claim 1 wherein the target cell-specific 
portion comprises an humanised HMFG-1 antibody or an antigen 
binding fragment thereof. 

3. A compound according to Claim 2 wherein the target cell-specific 
portion is an humanised HMFG-1 antibody. 

4. A compound according to Claim 1 or 2 wherein the target cell-specific 
portion comprises an antigen binding fragment of the humanised 
antibody selected from the group consisting of Fab-like molecules, 
such as Fab and F(ab') 2 , Fv molecules, disulphide-linked Fv 
molecules, ScFv molecules and single domain antibodies (dAbs). 

5. A compound according to Claim 4 wherein the target cell-specific 
portion comprises a Fab molecule. 

6. A compound according to Claim 4 wherein the target cell-specific 
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7. 



8. 



9. 



10. 



portion comprises a Ftab'k molecule. 

A compound according to Claim 1 wherein the target cell-specific 
portion comprises at last part of one or both of the amino acid 
sequences of Figure 3 . 

A compound according to Claim 7 wherein the target cell-specific 
portion comprises both of the amino acid sequences of Figure 3 . 

A compound according to any one of Claims 1 to 8 wherein the 
cytotoxic portion has DNA endonucleolytic activity. 

A compound according to Claim 9 wherein the cytotoxic portion is at 
least the catalytically active portion of a DNA endonuclease. 



11. A compound according to Claim 10 wherein the endonuclease is a 
mammalian deoxyribonuclease I. 

12. A compound according to Claim 11 wherein the endonuclease is a 
human deoxyribonuclease 1. 

13. A compound according to Claim 1 wherein the endonuclease is a 
restriction endonuclease. 

14. A compound according to Claim 10 wherein the cytotoxic portion 
comprises the amino acid sequence of Figure 2. 

15. A compound according to any one of Claims 1 to 14 wherein a nuclear 
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localization signal is incorporated. 

16. A compound according to Claim 15 wherein the nuclear localization 
signal comprises the sequence PKKKRKV . 

17. A compound according to any one of Claims 1 to 16 wherein the 
target cell-specific portion and the cytotoxic portion are fused. 

18. A compound according to Claim 17 wherein the target cell-specific 
portion and the cytotoxic portion are separated by a lniker sequence. 

19. A compound according to Claim 18 wherein the linker sequence is or 
comprises GG or GSGG 

20. A nucleic acid encoding a compound as defined in any one of Claims 
17 to 19. 

21. A vector comprising a nucleic acid according to Claim 20. 

22. A host cell comprising a vector according to Claim 21 

23 . A pharmaceutical composition comprising a compound according to 
any one of Qaims 1 to 19 and a pharmaceutical^ acceptable carrier. 

24. A compound according to amy one of Claims 1 to 19 for use in 
medicine. 



25 



Use of a compound according to any one of Claims 1 to 19 in the 
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preparation of a medicament for treating a mammal having said target 
cells to be destroyed. 

26. A method of treating a mammal having target cells to be destroyed, the 
method comprising adniinistering a compound according to any one of 
Claims 1 to 19 to said mammal. 



27. 



27. 



A use according to Claim 25 or a method according to Claim 26 
wherein the mammal is a human. 

A use according to Claim 25 or a method according to Claim 26 
wherein the target cells to be destroyed are cancer cells. 



28. A use or a method according to Claim 27 wherein the cancer cells are 
epithelial cancer cells. 

29. A use or a method according to Claim 28 wherein the cancer cells are 
ovarian, gastric, colorectal and/or pancreatic cancer cells. 

30. A use or a method according to Claim 29 wherein the cancer cells are 
are ovarian cancer cells. 

31. A compound substantially as described herein, preferably with 
reference to one or more of the accompanying figures. 
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ABSTRACT 



Com pounds for Tareetim 

A compound comprising a target ceU-specific portion and a cytotoxic portion 
characterised in that flie target ceU-specific portion comprises an humanised 
monoclonal antibody having specificity for polymorphic epithelial mucin 
(PEM), or an antigen binding fragment thereof, and the cytotoxic portion has 
endonucleolytic activity. Preferably, the target cell-specific portion 
comprises an humanised HMFG-1 antibody or an antigen binding fragment 
thereof. Advantageously, the cytotoxic portion is at least the catalytically 
active portion of a DNA endonuclease, e.g. a human DNA endonuclease I. 
The invention further provides nucleic acids encoding the compounds of the 
invention, and the use of such compounds in medicine, e.g. in the treatment 

of cancer. ? 
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PRI 



VERSION 

KEYWORDS 
SOURCE 



06-MAR-1995 



^?f SI0N S^3 mRNA ' con >P^te c ds 

' M55983.1 GI. -181623 

DNase I. 

psssr^s:; 2= — - «™. 

1 (b a3es i to 2033, * fnini ' Hoiranidae; Homo. J,la ' 
Shak,S., Capon, D. a , Spll . 

Recombinant human W.J J"J2±: "*"t erS/S . A . and 

Acad . . 3 the viscosity of ^t\ e c r ' f c : b L ; osis 

noejjrz Acad - Sc - 87 (23) , 919e _ 9192 (199o) 

Location/Quali-fiers 
1..1039 

/clone="hDNase-18-Z» 
/tis S ue_ type=npancreas „ 

/gene= ,, DNase I 
160. . 1008 

/gene="DNase I 
/ codon_s t ar t «1 

/product="DNase I" 

/protein_id-»AAft6317p . 1 « 
/dbxref « »GI ; 181624^ 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 



CDS 



mar pe prirfo 



260. . 1008 

/gene»"DNase I" 
226. . 1005 

/gene-"DWase x n 
/proauct=»DNase i" 

305 c 282 g 



BASE COUNT 

« JS?t??:j ^: gtgcctt »«srtgcttc 

121 cattctcrtc ??T? Ma * 9«?»eattg 

ioi au ^cT:cgt:c atctctgago acafoa^^f 
131 ggggcgctgc tggcact^r * cat <=accat 
. 241 ttcaa Cat c c affcat^ ?9CCCtQct 9 
301 gtgcaaafeJ a ^catttgg gga gaccaa 

?s ssS 35 

- «i js ct9owt «"***io 

601 aggSattS tca *= c ^** ^ gccatt 

721 atggcrcqact J™* SMtgtccaa 
791 ctSSSJS ct *"***t 
841 cccacgcact JtJccJataJ « agt ^ta 
901 gttcceera^ * r 9 c «atga caggatcgtg 
gttcccgact cggctcttcc ctttaacttc 



226 t 



ttcagagacc 
tcatcaaagg 

catctcagga 
cagggg gcC g 

atgtccaatg 
ctggtccagg 

aatcaggatg 

tataaggag C 
tactacgatg 
? tca ggttct 

gccccggggg 

gagaaatggg 

gtgagaccct 

atccccgaca 
gttgcaggga 

caggctgcct 



tuccrcata 
atat tccaga 
tgaggggcat 
tgtccctgaa 
ccaccctcgt 
a ggtcagaga 
caccagacac 
gctacotgtt 

atggctgcga 
tctoccggtt 

acgcagtagc 
gcttggagga 
Gccagtggtc 
acgctoacac 
tgctgctccg 

atggcctgag 



gacta.cctct 
ttcttgacag 

gaagctgctg 

c agctacatt 
cagccacctg 

ctatcactac 
cgtgtacagg 

goootgcggg 

c gagatcgac 
cgtcatgttg 
atccatccgc 
cacagctaca 
aggcgccgtt 
tgaccaactg 
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961 gcccaagcca tcagtgacca ctatccagtg gaggtgatgc tgaagtgagc agcccctccc 
1021 cacaccagtt gaactgcag 
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ACCESSION 

SOURCE 

ORGANISM 



TITLE 
FEATURES 

lourcQ 



7B3 bp mRNA 
raRNA, complete cda, 



PHI 06-ttM*-i995 
Wacujrc sequence pioaified to remove Nairl 



COS 



0Na.se I, 
Homo sapiens 

vo£?3E!^ ^Z 1 **™**^} eaio ^^; Mecsxoa; ChortSatA; 

I S^w?' Prl4nat « S ; c a «rrtl„i f HomAnida^ H«w. 

C fP°G,D.a w Hellmi**,*., Marstcrc, S.A. AAd (take*, C X, 
B^C ? rcbta«t huil^ DNa_se I reduces the viscosity of c^tiS'^e,!. 

SISS-J^* 1 -- A " d - Sci - 31*8-9192 (1930) 

I*»«at /Qua 1 i f i 0 C5 
X. .1039 

/clone* *hDt*Me- 16-3," 
160. .225 

ICQ. .1008 

/codon_€t & am = l 
ZdtL-X^f* ,l PID:glSi624" 

iso_ .no do 



BASE COUNT 
ORIGIN 



226. .1005 
166 a 226 e 520 g 



1S9 t 



1 CIGAAGATCG 
Gl CTCGTCAGCT 
122 AGAGACAGCC 
1GI OACACCTATC 
241 CXGT3TCaTCT 
301 TCCGACCCCT 
361 CCGTTCACAC 
421 CTACCCCACA 
m GACGACGTCA 
541 TGGTCATCCA 
$01 GACACCAC»6 
661 CXCCGAGCCC 
721 CTCAGTGACC 
781 TGA 



CAGCCTTCAA 
ACATTGTGCA 

ACCTGACTGC 
ACTACGTOOT 
ACAGGCCTGA 
GCGGGAACGA 
A GC XC A GGG A 
TCGACGCTCT 
TGTTGAXGGG 
TCCGCCTGTG 
CTACACCCAC 
CCCTTCTTOC 
AACTGGCCCA 



CATCCAGACA 

GXTCCTGAGC 

CGTGGGGAAC 

CAGTGAGCCA 

CCAGGTCTCT 

CACCTTCAAC 

CTTTGCCATT 

CTATGACGTC 

CGACTTCAAT 

GACAAGCCCC 

GCACTGTGCC 

CGACTrGGCT 

AGCCATCAGT 



TTTGGGGAGA 

CGCTACGACA 

CTGCTGGACA 

i=TGGi?ftCGGA 

GCGGTGCACA 

CGAGAQCCAG 

GTTCCCCTGO 

TACCTGGATG 

GCGGGCTGCA 

ACCTTCCAGT 

TAXGACAGGA 

CTTCCCTTTA 

GACCACTATC 



CCAAGATETC 
TCGCCCTGGT 
ACCTCftATCA 
ACA^CTATAA 
G CT ACT ACT A 
CCATTGTCAG 
AXCCCCCCCC 
XCCAACAGAA 

CCTATGTCAC 
GGCTGATCCC 

TCGTGGTTGC 
ACTTCCAGrac 
CAGTGGAGQT 



CAATGCCWCC 
CCAGGAGGTC 

CGACGCACCA 

GGAGGGCTAC 

CGATGATX3GC 

GTTCT7 C?dC 

COGGGACCCA 

AXGGGGCTTG 

ACCCroCCAG 

CGftCAGCGCT 

AGGOATGCTG 

TGCCTATGGC 

GATCCTCZAAC: 
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LOCUS 

DEFINITION 
ACCESSION . 

DRGAK2SW 
REFERENCE 
ACTHOR5 
xXTLE 

cghkewt 

SITES 



HMFG1LC2.D 721 bp OWA 

TOWANISED BMT«1 tilGKJ CMAIK Vnp LBADJSK, 



1 <BASES 1 TO 342 > 

C0W3XWCTIOK OP RESHAPED HMFG1 £TC 
rHMOWOii. C1S93) J7B„ 364-370 

SCANNED XW CTROM JOOFLNAL 



Tbis ifi tfc« *«3U*n«ft of tk* HMFG1 JLight chain qcn c with r*- 

I2 P ^"? cr e ^ uenQ ^ attach^, Traiuiar* froS * r * 

iii"—^ in t > A in all c lon M leading 

to fU33 silent mutation <t ± n verhoeyen »a» cr ) 9 



BASE COUWT 
ORIGTN 



{.oven 



197 



202 c 



162 9 



140 t 



1 ATGGGATGGA 
€1 ASCCAQA'TCA 
121 ACCTGTAAGT 
1ft 1 TACCAGCAGA 
241 TCTGGTCTGC 
301 AGCAGCCTCC 

361 CCGACGTTCG 
421 TTCATCTTCC 
491 CTGAATAACT 
541 TCCCCTAaCT 
601 AGCAGCACCC 
6€1 OTGACCGATC 
721 A 



CCTCTATCA1 
CCCAOAQCCC 
OCAGTCAGAG 

AGCCAGGTAA 
CAAGCtVGATT 

AGCCACAGGA 
GCCAASGGAC 
CGCCATCTCft 

tCTATCCCAG 
CCCAGGAGAG 

TGACGCTQAG 
ACOGCCTGAG 



CCTCTTCrXG 

AAGCAGCCTG 

CCTTTTATAT 

©GCTCCAAAO 

CAGCQ0TAOC 

CATCGCCACC 

CAAGCTCCAA 

TGAGCAGTTC 

AGAGGCCAAA 

TGTCACAGAG 

CAAAGCAGAC 

CTCGCCCGTC 



GTACCAACAG 
ACCGCCAGCC 
ACTACCAATC 
CTGCTGATCT 
GGTAGCCCTA 

TACTACTGCC 

ATCAAACCAA 
AAATCXGGAA 
CTACAGTCCA 
CAGGACAGCA 
TACGAGAAAC 
ACAAAGAGCT 



CTACAGGTTjX 

TGGGTGAOU3 

AAAAC5ATCTA 

ACTGGGCA.TC 

CCGACTTCAC 

AGCAATAXTA 

C,T CTCGCTSC 

CTGCCTCTCT 
AGGTCGATAA 
AGGACACCAC 
ACAAAGTCTA 
TCAACAC3GGG 



CTXGGCCIGXS 
CACTAGGGAA 
CTTCACCATC 
TAGATATCOT 
ACCATCTGTC 

TGTGTGCCTG 
CGCCCTCCAA 
CTACAGCCTC 

CGCCTQCGAA 
AGAGTGTTAG 
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ACCESSION 

KEYWORDS 

SOOKC3T 

KBFSOTNCE 
TITLE 

FEATURES 



1404 bp DCT. 
gPMMgjSZD HMFG1 tmav y chain 

■BHrmriB ■ — til 




BASE CdUwr 
Oft 1X3 IN 



V TO A) 

clone* 



VERH0B*BN M At 

Y? SCANNED Zft FROM »TOURNAj> 

No*© ' T lrt adi.ncf to silent awtation in 

«1 ^Sllxl «™MW CCftCT^kM 

1?1 TGCAftGGTTT CTGCCTACftC CTTeSISS £££^25 GGSCCTCXflT GAAQ6TOTCC 

lM ggaaasggcc tcwgwm? nram? ^l^T 3 ^ CC **^CCA 

S4i GAGAAarTCA ASGSCC6AST GACAOTCACT Aca?SS^ CTAATAATTC TAGATACAAT 
301 BACJCTCAGCA GCCTGAGCTC ToS^ G^t£££ JS^S™ 5 ASCCTACATC 
361 TTTGCCTGar TTCCTTACTG BGQCCAxS^ AfTGTGCAAG ArCCTACQAC 

«1 AAGGCCCCAT CGSTCTTCCC CCTGGCACCC tS?£??« SflST 50 *" 1 *»e*CCiiCc 
«1 OTOCIOQSCI GCCTGGTCA* GGACTflCTTC rr^^^S °CACCTCTGG GGGCACXgcg 
§41 GGCGCCCTGA CCACGCOCST CcKcIc^c eee^^S JSACGQTtSTC GTGGAACTCX 
601 TCCCTCAGCA GCGTGGTt^r ^rGCCCTCe *££?S$!S C TfcC RGTCCTC AGGACTCTAC 
661 AACCTRAAfn ACAARCCCAG S^tISS^ i5Si GCTIGG GCACCCAGAC CTACATCTCC 

TT lULVHiC CCCCAAAACC CAACG^UAcc cr^^^ TCCTGQCGGG ACCCTCACTC 
A41 TGCGTWTGG TGGACGTGAG eoSSS ^$S° RCCCC TGAGQTCACA 

901 TCCGTGGAGC TGCATAATGC CAAGACAAAG «S™ ^IJ?^ GTa GTACCT6CAC 
9«1 CCXCrBSTCA flCOTRCTCAi: CCTTCCTGCfiC Srr^r?^ ACCAGTACAA CAGGACGTAO 
1021 IGGAAGGTCT CCAACAAACC CCTOfiC^cc rCaA TGGCAA GGAGTACAAG 

10B1 GGGCAGCCCC CAGAACCArA GGTGTACACC ^^TCTC CaAAGCCAAA 

11*1 AACCAGGTCA GCCTGACCTG CCTGGTCAaS ££?5?£S5 CCCGCQatga GcrfiACCAA© 

laoi tgg^agc^ -tccocagcc coacaacaac m2SS?S S2^ ACAt cgccgtcgag 

1^61 CACGGCTCCT TCTTCCTCTA CAGCAACCTr s?£-£2™S? CGCCTCCCGT GCTGGACTCC 
1321 AACCTCTTCT CATCCTcSt SESam™ gSSS^ A «GCAGGTG GCAGCAGCGG 
lSoi CTCTCCCTGT CTCCGOST flA ffi aA GCTCTGCACA ACCACTACftC GCACAAGAGC 
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M pAS23 



LOCUS PAS23.DNA 1554 bp mRNA PRi 06-MAR-1QQS 

ACCESSION^ Humanised HMFG1 Fab ' 2 fused to human DNase I (conaLurt l} 
NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified r*/f 

MHDNASE1 . dna) y itmis raoairiea c/f 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata- 
^v^"^ Eutheria; P^tes; catarrhini; Hominidae; Homo.. 

t™** 5 T 't' Ca P° n ' D - J " Hellmi SSrR ., M ar S ters,S.A. and Ba*er,C.L. 

Recombinant human DNase I reduces the viscosity of cystic fibrosis 

JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9163-9192 C1990) 
MEDLINE 91067672 U3SD ' 

BASE COUNT 344 a 468 c 434 a 308 t 

ORIGIN * 

* ATGGGATGGA GCTGTATCAT CCTCTTCTTG G TAG CAAC AG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
,1, ^™^ CTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
If} ° A ^r CA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACcac 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTC-ACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCAcIgCG 
i*} GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
6Q1 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
fabl AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTtpt 
iVi rr^ CTC ACACATGCCG ACCGTGCCCA GCACCTGA A6 ggggggg 
781 GCAGCCTTCA ACATCCAGAC ATTTGGGGAG ACCAAGATGT CCAATGCCAC CCTCGTCAGC 
li} TACATTGTGC AGATCCTGAG CCGCTACGAC ATCGCCCTGG TCCAGGAGGT CAGAGACAGC 
o« ^ TGACTG CCGTGGGGAA GCTGCTGGAC AACCTCAATC AGGACGCACC AGACACCtS 

noJ ^l^l GG TCAGTGAGCC actgggacgg aacagctata aggagcgcta cctgttStc 

1021 TACAGGCCTG ACCAGGTGTC TGCGGTGGAC AGCTACTACT ACGATGATGG CTGCGAGCCC 
1081 TGCGGGAACG ACACCTTCAA CCGAGAGCCA GCCATTGTCA GGTTCTTCTC CCGGTTCACA 
ilii GA ^^ AGGG AGTTTGCCA * TGTTCCCCTG CATGCGGCCC CGGGGGACGC AGTAGCCGAG 
1201 ATCGACGCTC TCTATGACGT CTACCTGGAT GTCCAAGAGA AATGGGGCTT GGAGGACGTC 
12 61 ATGTTGATGG GCGACTTCAA TGCGGGCTGC AGCTATGTGA GACCCTCCCA GTGGTCATCC 
1321 ATCCGCCTGT GGACAAGCCC CACCTTCCAG TGGCTGATCC CCGACAGCGC TGACACCACA 
GCTACACCCA CGCACTGTGC C TAT GACAGG ATCGTGGTTG CAGGGATGCT GCTCCGAGGG 
1441 GCCGTTGTTC CCGACTCGGC TCTTCCCTTT AACT T CCAGG CTGCCTATGG CCTGAGTGAC 
1501 CAACTGGCCC AAGCCATCAG TGACCACTAT CCAGTGGAGG TGATGCTGAA GTGA 



// 



v 



2fr 
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. File : PAS23 . DNA - 

Range : 1 - 1554 Mode : Normal r\£l\t€ ^ 

v / coflon Table : un i vprqai 9 'VvW; O 




9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



M G W 



V H 



63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

SQVQLVQSGAEVKK&CAS 

117 126 135 144 153 162 

CTG AAC CTC TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

Y K V S C KASGYTF S A ~ Y W I E 

171 180 169 138 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

W V R Q A PGKGLEWVGEI L P 

225 234 243 252 251 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

gsnks rynekfkgrvtvt 

288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RD T STNTAYMELSSLRSE 

333 342 351 360 369 379 

GAC ACA CCC CTC TAT TAC TCT CCA ACA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAWFAY 

337 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



W G Q G T 1> V 



K 



441 450 459 468 477 4B6 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VF PLAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



V K 



V S W N 



549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



G A T S G V H T F P A V D Q S 'JO 

- 1 - 
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603 . 612 "I 630 639 648 
CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 
i — 

L Y S L 5 s V V T V P S S S L G T ~Q~ 

557 6 S6 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TYICNVNHKPS NTKVDKK 

711 720 729 7 38 747 75S 

GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

V E P K S = D K T H T C P p" V ~p" "a" "p" 

™„ »•„ ™ 783 7 " 801 810 

GAA (GGG AGC GGC GGGf CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG 

' BGSGGl ' K:i: AAFNIQTFGB 

819 828 337 g46 855 864 

ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC 

T K M 5 '» a t l v s y i v q i" "l" 7 "r" 

873 882 391 900 90 9 918 

TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG 

y dia i>vq evrdshltavg" 

527 336 945 954 953 972 

AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC 

KLL DWLNQDAPDTYHYv'C 

981 990 999 100B 1017 1Q2S 

AGT GAC CCA CTC CCA CGC AAC ACC TAT AAG GAG CGC TAC CTG TTC GTC TAC ACC 

Seplg *nsykeryi j fv~y~"b" 

1035 1044 1053 1062 1071 10eo 

C " ™ ^ AGC TAC TftC TAC GAT GAT GGC TGC GAG CCC 

PDQ VSAVDSYYYDDGCBP 

1089 1098 1107 lug 1125 1134 

TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG 

CG NDT FNREPAIVR K F 7 7 

1143 1152 H61 ^ 1170 1179 1188 

TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC 

F T E V * E P A I V P L H T T "p" T "d" 

1197 1206 1215 1224 123 3 1242 " 

-2- 
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GCA GTA GCC GAG AT C GAC QCT CTC TAT SAC GTC TAC CTG GAT GTC CAA GAG AAA 




A 


V A 




I D 


A 


L y 


D 


V Y 


L 


ti V 


Q 


5 K 


TGG 


1251 
GGC TTG 


GAG 


1260 
GAC GTC 


ATG 


1253 
TTG ATG 


GGC 


1278 


O-ri. J, 


1287 


TGC 


1296 
AGC TAT 


W 


G L 


E 


D V 


M 


L M 


G 


D F 


N 


A G 


C 


s v 


GTG 


1305 
AG A CCC 


TCC 


1314 
CAG TGG 


TCA 


1323 
TCC ATP 




1332 


ACA 


1341 
AGC CCC 


ACC 


1350 
TTC CAG 


V 


R P 


S 


Q W 


s 


s X 


R 


L - W 


T 


S P 


T 


F Q 


TGG 


1359 
CTG ATC 


CCC 


1368 
GAC AGC 


GCT 


1377 
GAC ACC 


ACA 


1366 
GCT ACA 


CCC 


1395 
ACG CAC 


TCT 


1404 
CCC TAT 



w 



T T 



H 



1413 1422 1431 1440 1449 145 8 

GAC AGO ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG 



R I V 
1467 



V A G 
1476 



M L L 
1485 



v v 



1494 1503 1512 

GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA 



A L P F 

J 1521 



1530 1539 1548 

GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 



D H 



V 



V M 



0056 



K 



-3- 
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11057 




LOCUS 



PAS27.DNA 



fO PAS27 



1584 bp 



DEFINITION Humanised HMFG1 Fab^2 fSfto human DNasfl with SV^S (conduct 

ACCESSION 
NID 

KEYWORDS DNase I. 

MSDNASEl.dn a r aSe 1 is fr ° m « aen bl B d. oligos (thus modified c/f 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
5hak,S., Capon,D. J., Hellaiss, R. , Marsters, S , A. and Baker, C.L 
JjatS^ ^ DNaSS 1 reduces the viscosity of cystic fibrosis 

Proc Natl. Acad. Sci. U.S.A. 87 (23), 9188-9192 (19901 
91067672 

354 a 474 c 446 g 310 t 



AUTHORS 
TITLE 



JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



// 



1 ATGGGATGGA 
61 GTGCAGCTGG 
121 TGCAAGGCTT 
181 GGAAAGGGCC 

2 41 GAGAAGTTCA 
301 GAGCTCAGCA 

3 61 TTTGCCTGGT 
.4 21 AAGGGCCCAT 

4 31 GCCCTGGGCT 

541 GGCGCCCTGA 
601 TCCCTCAGCA 
661 AACGTGAATC 
721 GACAAAACTC 
791 GCAGCCTTCA 
8 41 T AC AT TG TCC 
901 C ACC TG AC T G 
961 CACTACGTGG 

1021 TACAGGCCTG 
1081 TGCGGGAACG 
1141 GAGGTCAGGG 
1201 ATCGACGCTC 
1261 ATGTTGATGG 
1321 ATCCGCCTGT 
1381 GCTACACCCA 
1441 GCCGTTGTTC 
1501 CAACTGGCCC 
1561 CCCAAAAAGA 



GCTGTATCAT 
TGCAGTCTOC5 

CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAUCCCJAC3 
ACACATGCCC 
AC AT CC AGAC 
AGATCCTGAG 
CCGTGGGGAA 
TCAGTGAGCC 
ACCAGGTGTC 
ACACCTTCAA 
AGTTTGCCAT 
TCTATGACGT 
GCGACTTCAA 
GGACAAGCCC 
CGCACTGTGC 
CCGACTCGGC 

AAGCCATCAG 
AGCGCAAGGT 



CCTCTTCTTG 

GGCAGAGGTG 

CTTCAGTC5CC 

CGGAGAGATT 

GACAGTCACT 

TGAGGACACA 

GGGCCAAGGG 

CCTGGCACCC 

GGACTACTTC 

GCACACCTTC 

CGTGCCCTCC 

CAACACCAAG 

ACCGTGCCCA 

ATTTGGGGAG 

CCGCTACGAC 

GCTGCTGGAC 

ACTGGGACGG 

TGCGGTGGAC 

CCGAGAGCCA 

TGTTCCCCTG 

CTACCTGGAT 

TGCGGGCTGC 

CACCTTCCAG 

CTATGACAGG 

TCTTCCCTTT 

TGACCACTAT 
TTGA 



GTAGCAACAG 
AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAC5A 
CCCGAACCGG 
CCGGCTGTCC 

AGCAGCTTGG 
GTGGACAAGA 
GCACCTGAAG_ 
ACCAAGATGT 
ATCGCCCTGG 
AACCTCAATC 

AACAGCTATA 
AGCTACTACT 
GCCATTGTCA 
CATGCGGCCC 
GTCCAAGAGA 
AGCTATGTGA 
TGGCTGATCC 
ATCGTGGTTG 
AACTTCCAGG 
CCAGTGGAGG 



CTACAGGTGT 

GGGCCTCAGT 

AGTGGGTGCG 

GTAATAATTC 

CCACAAACAC 

ACTGTGCAAG 

CAGTCTCCTC 

GCACCTCTGG 

TGACGGTGTC 

TACAGTCCTC 

GCACCCAGAC 

AAGTTGACCC 

GGAGCGGCGG 



CCAATGCCAC 
T C C AG GAG G T 
AGGACGCACC 
AGGAGCGCTA 
ACGATGATGG 
GGTTCTTCTC 
CGGGGGACGC 
AATGGGGCTT 
GACCCTCCCA 
CCGACAGCGC 
CAGGGATGCT 
CTGCCTATGG 
TGATGCTGAA 



CCACTCCCAG 
GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 

AG G AC T C T AC 
CTACATCTGC 
CAAATCTTGT 
J3CTGAAGATC 
CCTCGTCAGC 
C A GAG AC AG C 
AGACACCTAT 
CCTGTTCGTG 
CTGCGAGCCC 
CCGGTTCACA 
AGTAGCCGAG 
GGAGCACGTC 
GTGGTCATCC 
TGACACCACA 
GCTCCGAGGG 
CCTGAGTGAC 
GGGGGGCGGA 
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' Fil« .- PAS27.DNA 

5?2 ge 4, i 7 1584 Mode : Normal wrr / 

N — Codon Table : Universal YVWkE, b 




I*) 



9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



V H 



63 72 81 go 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAa' AAG CCT GGG GCC TCA 



SQVQLVQS 



V K K 



117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



V K V S C 



W 



171 ISO 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WVRQ APGK 



W 



225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



G 


S 


N 


N 


S 


R 


Y 


N 


E 


K 


F 


K 


G 


R 


V 


T 


V 


T 


ACA 


CAC 


279 
ACA 


TCC 


ACA 


288 

AAC 


ACA 


CCC 


297 
TAC 


ATC 


CAC 


306 

CTC 


ACC 


ACC 


315 
CTC 


ACC 


TCT 


324 
CAC 


R 


D 


T 


5 


T 


N 


T 


A 


y 


M 


E 


L 


S 


s 


L 


R 


S 


E 


GAC 


ACA 


333 
GCC 


GTC 


TAT 


342 
TAC 


TGT 


GCA 


351 
AGA 


TCC 


TAC 


360 
GAC 


TTT 


GCC 


369 
TGG 


TTT 


GCT 


378 
TAC 



T A V V Y C 



W 



387 396 405 414 423 ' 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



w 


G 


Q 


G 


T 


L 


V 


T 


V 


S 


S 


A 


5 


T 


K 


G 


P 


s 


GTC 


TTC 


441 
CCC 


CTG 


GCA 


450 
CCC 


TCC 


TCC 


459 
AAG 


AGC 


ACC 


468 
TCT 


GGG 


GGC 


477 
ACA 


GCG 


GCC 


486 
CTG 


V 


F 


p 


L 


A 


P 


S 


S 


K 


S 


T 


S 


G 


G 


T 


A 


A 


L 


GGC 


TGC 


435 
CTG 


GTC 


AAG 


504 
GAC 


TAC 


TTC 


513 
CCC 


GAA 


CCG 


522 
GTG 


ACG 


GTG 


531 
TCG 


TGG 


AAC 


540 
TCA 


G 


C 


L 


V 


K 


D 


Y 


F 


P 


E 


P 


V 


T 


V 


S 


■ W 


N 


S 


GGC 


GCC 


549 
CTG 


ACC 


AGC 


558 
GGC 


GTG 


CAC 


567 
ACC 


TTC 


CCG 


576 
GCT 


GTC 


CTA 


585 
CAC 


TCC 


TCA 


594 
CCA 



VH TFPAV LQSSG 
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. 603 512 "1 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYS LSSVVTVPSSST. a n> rs 



657 SS6 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



711 



M V M H 



720 



K 



D K 



K 



729 738 ?47 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



K 



K 



H 



755 774 783 792 801 810 

GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG 



K 



819 S28 937 846 855 864 

ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC 



M 



V 



873 882 891 900 90S 318 

TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG 



A 



927 



V Q 



936 



V 



945 954 963 972 

AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC 



Q 



D 



H 



V 



981 



990 



999 10D8 1017 - 1025 

AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG 



K 



V 



1035 1044 1053 1062 1071 1080 

CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC 



1089 1098 1107 1H6 1125 1134 

TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG 



jsr D 



N 



A 



V R 



1143 



1152 



1161 1170 1179 1188 

TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT CCC CCC CCC CCC CAC 



H 



1197 



1206 



1215 



1224 
-2- 



1233 



1242 
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GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA 




A 


V A 


E 


I D 


A 


L Y 


D 


V Y 


It 


D V 


Q 


E K 


TGG 


1251 
GGC TTG 


GAG 


1260 

GAC GTC 


ATG 


1269 

TTG ATfi 




1278 


AAT 


1287 
GCG GGC 


TGC 


1296 
AGC TAT 


W 


G ti 


E 


D V 


M 


L M 


G 


D F 


N 


A G 


C 


S V 




1305. 
AGA CCC 


TCC 


1314 
CAG TGG 


TCA 


1323 
TCC ATC 




1332 




1341 

AGC CCC 


ACC 


1350 
TTC CAG 


V 


R P 


S 


Q W 


S 


S I 


R 


L W 


T 


S P 


T 


F Q 


TGG 


13S9 
CTG ATC 


CCC 


136a 

GAC AGC 


GCT 


1377 
GAC ACC 


ACA 


1386 
GCT ACA 


CCC 


ACG CAC 


TGT 


1404 
GCC TAT 



W L I P 
1413 



D . S A D 
1422 



H 



1440 1449 1458 

GAC AGG ATC GTG GTT GCA GGG ATG CTC CTC CCA CCC CCC CTT GTT CCC GAC TCG 



D JR. I V 
1467 



M 



A V 



V 



1476 



1485 1454 1503 1512 

GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA 



KT 



0 



D 



1521 1530 1539 1548 1557 lSfig 

GCC ATC AGT GAC CAC TAT CCA CTC CAC CTC ATC CTQ AAG GGC GGC GGA CCC AAA 



H 



1575 15S4 
AAG AAG CGC AAG GTT TGA 3 



V 



V M 



K 



K 



K K 



K 



(2)060 



-3 
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(A) PAS34 



SS^r-nr™ PAS34 ' D NA 2196 bp ' 2196 bp 2196 bp DNA 14-AUG-1998 

SSSS J?^ 1 ?? - H ! P ? 1 heaVY Chaln fUS6d t0 h ™ ^se construct 34 

DEFINITION Clone 16.4.2 (same as hcdnasel . dna template file! 

REFERENCE ' 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993) : 78, 364-370 

(Sase.dnar 311 " a rMult ° f 0li *° "»« 

SSSESU off made U3ing flapping oligos AS79 and AS80 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I E V TO Al 

"SSS 3 5ole dUe 13 G > T leading t0 SilSnt -ratlin in all cJL 

BASE COUNT 501 a 677 c 607 a 411 t 

ORIGIN y 



5 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG 5TAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
Hi I^ CAAGGCTT "GGCTACAC CTTCAGTGCC T AC T GG AT AG AGTGGGTGCG CCAGgSJS 

ll\ GGAAAGGGCC TCGAGTGGGT cggagagatt TTACCTGGAA GTAATAATTC TAGATACAAT 

lei GA ? AAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCcSSS 

TTTGCCTGGT TTGCTTACTG G GGCC AAGGG ACTCTGGTCA CAGTCTCCTC AGCCTcSJc 

421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG SgSSSScg 

J" rrrrl^l GCCTGGTCAft GGACTACTTC CCCGAACCGG TGACGGTGTC SSSSSS 

541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGQCTGTCC TACASTCCTC AGGACTCTAC 

601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 

121 ^ GCCCAG a^raccafto gtggacaaga aacttgaccc <»amotS 

721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 

781 TTCCTCTTCC CCCCAAAACC CAACGACACC CTCATGATCT CCCGGACCCC SgScaS 

!" ISS? 5 ™ TG ^TGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG £S££2£ 

901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 

10li S££E££ ^TGCAC CAGGACTGGC 5S2S££J S£S£SE 

lU^l TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTr rns nrwnv* * 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC "ctSSSS SSSSSSE 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT ScosSSa^ 
llll ^S GGCAGCC GGAGAAGAA C TACAAGAcS SSSS? gSgSScc 

"Si SSSSSS ^agcaagctc ^cgtggaca agagcaggtg gcagcagggg 

llll SScrrrSaa ^££^5 GCTCTGCACA ACCACTACAC GCAGAAGAGC 

77-7 CTCTCCCTGT CTCCGGGTAA AGGGAGCGGC_GGGCTGAAGA TCGCAGCCTT CAACATCCAG 
7 = n7 ™" TGGGG AGA CCAAGAT GTCCAATGCC ACCCTCGTCA GCTACAT^GT GCAGATCCTG 
1501 AGCCGCTACG ACATCGCCCT GGTCCAGGAG GTCAGAGACA GCCACCTGAC TGCCSTsSr 

1621 C^aSgSc SSJSISi J CAGGACGCA CCAG ^C? AtScSSt StCAGTGaI 

i Si S°^ GGGA ^ ggaacagcta taaggagcgc TACCTGTTCG TGTACAGGCC TGACCAGGTG 

™ ^TGCGGTGG ACAGCTACTA CTACGATGAT GGCTGCGAGC CCTGCGGGAA CGACACCTT? 
1741 AACCGAGAGC CAGCCATTGT CAGGTTCTTC TCCCGGTTCA GAGAGGT CAG GGaSSIcS 
1801 ATTGTTCCCC TGCATGCGGC CCCGGGGGAC GCAGTAGCCG AGMcSSc TCTCtSSc 
iSJl ATGTCCAAGA GAA ^TGGGGC TTGGAGGACG TCATGTTGAT GGGCGACTTC 

1921 AATGCGGGCT GCAGCTATGT GAGACCCTCC CAGTGGTCAT CCATCCGCCT GTGGACAAPP 
1981 CCCACCTTCC AGTGGCTGAT CCCCGACAGC GCTGaScCA CAGcSScC ScgSSgt 

2041 gcctatgaca ggatcgtggt tgcagggatg ctgctccgS gggccgSgt tcccSScg 
210! gctcttccct ttaacttcca ggctgcctat ggcctgagtg accaactggc ccSSSSc 
2161 agtgaccact atccagtgga ggtgatgctg aagtga ccaagccatc 
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File r PAS 3d . DNA 

Range : l - 219 6 Mode ; Normal 

Codon Table : Universal 



16 



27 



36 



* ATG GGA TGG AGC TOT ATC ATC CTC TTC TTC GTA GCA ACA CCT ACA GOT CTC CAC 



W 



63 



A 



V H 



72 



81 90 99 108 

^ TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



V 



K K 



117 126 135 144 * 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



V 



171 



W 



180 



189 153 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



w v 



wr v 



225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



N N 



279 



N 



K 



K 



V 



V 



238 



297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



©062 



M E 



333 3 42 3S1 360 369 376 

CAC ACA CCC CTC TAT TAC TCT CCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

D T A V Y Y C A R S Y D F A W " V ~A~ V 

387 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



W 



V 



V 



441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA QCG GCC CTG 



K 



A 



495 S0 « S13 522 531 540 

CCC TCC CTC CTC AAC CAC TAC TTC CCC CAA CCC CTC ACC CTC TCC TCC AAC TCA 



V K 



W N 



549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



V H 



V 



503 612 621 630 S3S> 643 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



65"? 



666 



675 



684 
- 1 - 



693 



702 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

' TYICNVNH-KPSNTKVDKK 

7 H 720 725 733 747 755 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

VEFKSCDKTHTCFPCPAF 

765 77 * 793 792 ©01 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC WC CCC CCA AAA CCC AAG GAC ACC 

ELIiGGPSVFLtFP FKPKDT 

919 32B 337 adG 8S5 964 

CTC ATG ATC TCC CGG ACC CCT GAG OTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 



MI SRTPEVTCVV 



H 



873 B62 331 900 ggg g iB 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

ED PEVKPNWVVDGVEVHJJ 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 

AKTKpRSBQYNSTYRVVS 

"1 990 999 1008 101*7 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 

VL TVLKQDWLNG.KEYKCK 

1035 10'44 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 

VSNKALPAPIBKTI SKAK 

10 63 109S 1107 H16 H25 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 

GQ^KEPQVYTLFFSRDBL 

1143 1152 1161 1170 1179 11SS 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 

TK.NQVSLTCLVKGFYPSD 

1137 1206 1215 122<l 1233 1242 

ATC OCC GT© OA© TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 

FPV'LDSDGSFFLYSKI,TV 

1303 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 

DKSRWQQGriVFSCSVMHE 



2- 
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1353 136B 1377 1386 1395 1404 

«GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCT CCG GCT AAA QGG 



H N 



H 



0 K 



1413 1422 1431 1440 1449 1458 

AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 



K 



K 



1467 



1476 1485 1494 1503 1512 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 



M 



N 



V 



1S21 1530 1539 1540 15S7 1S66 

ATC CCC CTC CTC CAG CAC CTC AGA CAC ACC CAC CTG ACT GCC GTG GGG AAG CTC 



A ' L 



V 



V 



R D 



H 



V 



1575 1593 1602 1611 1620 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 



N 



H 



V V 



lg 29 1638 1647 1656 1665 1674 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 



R N 



K 



V 



1683 1692 1701 1710 1719 1728 

CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG 



V 



1737 



1746 



1755 1764 1773 1732 

AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA 



A 



T 



1791 



1900 1809 1818 1827 1836 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 



V 



H 



1845 1854. 1863 1872 1881 1890 

GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC 

AETD A LYDVYLDVQEKWG 

1899 1908 1317 1926 1935 1944 

TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC ACC TAT GTG AGA 



V M 



D 



N 



V 



1953 



19S2 1971 1360 1?8? X33B 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG 



w 



w 



2007 2016 2025 2034 2043 2052 

ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 



K 



A 



3- 
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( 




2061 20*70 2079 2088 2097 ■ 2106 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 



X V V A G MLLRGAVVP D 3 A X, 

2115 2124 2133 2142 2151 2160 

CCC TTT AAC TTC CAG GCT GCC TAT GCC CTG AGT GAC CAA CTG GCC CAA GCC ATC 

PFNFQAAYGLSDQLAQA X - 

2169 2178 2187 2196 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

SDKYPVEVMLK* 
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(A) pAS35 



LOCUS PAS35.DNA 2193 bp 2193 bp DNA 14-AUG-iq<5R 

SSSfSS ™ mm ™ D ™ FG1 heav V chain to human DNAse construct 35 

DEFINITION Clone 17.12.1 with silent K to EC mutation (1398 A > G) 
REFERENCE 1 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 

^^fr»L ™ S ruj3ion was made using overlapping oligos AS81 and AS82 
FEATURES AA RESIDUE Z35 HAS NOT BEEN CHANGED TO KABAT (I.E V TO A) 

Res idue 963 is G > T leading to silent mutation in all clones 
FEATURES In 17.12.1 residue 1398 is A > G (silent K to K mutation) 
SITES Note 

BASE COUNT 500 a 677 c 60© g 410 t; 

ORIGIN •> 

■ 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
SI GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181- GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
24 x GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
3 SI TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTCGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 CCCCTGGGCT GCCTGGTCAA GGACTAGTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AG G AC T C T AC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAC CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCPGTC 
TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGAGCCC TGAGGTCACA 
8 41 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
i m™^ TCA GCGTCC TCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 

2 ^™^T CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1031 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
llii ^^ GTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGC GAC AT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
^S C ^ CT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAgSgGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGArr 
1381 CTCTCCCTGT CTCCGAA gGG GAGCGGCGGG CTGAAGATCG CAGCCTTCAA CA^SS 
1441 TTTGGGGAGA CCAAGATGTC CAATGCCACC CTCGTCAGCT ACATTGTGCA GATCCTGAGC 
1501 CGCTACGACA TCGCCCTGGT CCAGGAGGTC AGAGACAGCC ACCTGACTG? CGTGGGGAAG 
1561 CTGCTGGACA ACCTCAATCA GGACGCACCA GACACCTATC ACTACGTGGT CAGTGAGCCA 
"fj ^GGGACGGA ACAGCTATAA GGAGCGCTAC CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT 
1681 GCGGTGGACA GCTACTACTA CGATGATGGC TGCGAGCCCT GCGGGAACGA CACCTTCAAC 
1741 CGAGAGCCAG CCATTGTCAG GTTCTTCTCC CGGTTCACAG AGGTCAGGGA GtSgCCaIS 

1801 gttcccctgc atgcggcccc gggggacgca gtagccgaga tcgacgctct ctatgacgtc 

1861 TACCTGGATG TCCAAGAGAA ATGGGGCTTG GAGGACGTCA TGTTGATGGG CGACTT^AAT 
1921 GCGGGCTGCA GCTATGTGAG ACCCTCCCAG TGGTCATCCA TCCGCCTGTG GACAAGCCCC 
1981 ACCTTCCAGT GGCTGATCCC CGACAGCGCT GACACCACAG CTACACCCAC GCACTGTGCC 
2041 TATGACAGGA TCGTGGTTGC AGGGATGCTG CTCCGAGGGG CCGTTGTTCC CGACTCGGCT 
2101 CTTCCCTTTA ACTTCCAGGC TGCCTATGGC CTGAGTGACC AACTGGCCCA AGCCATCAGT 
2161 GAC C AC TAT C CAGTGGAGGT GATGCTGAAG TGA 
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Mode 



ERIC POTTER CLARKSON 
Normal 



@067 



ATG 


GGA 


5 

TGG 


AGC 


TOT 


. la 

ATC 


ATC 


CTC 


27 
TTC 


TTG 


GTA 


36 
GCA 


* 

ACA 


GCT 


45 
ACA 


GGT 


GTC 


54 
CAC 


M 


G 


W 


s 


C 


I 


I 


L 


T 


h 


V 


A 


T 


A 


T 


G 


V 


H 


TCC 


CAG 


€3 
GTG 


CAG 


CTG 


72 
GTG 


CAG 


TCT 


ai 

GGG 


GCA 


GAG 


90 
GTG 


AAA 


AAG 


99 
CCT 


GGG 


GCC 


108 
TCA 


S 


Q 


V 


Q 


L 


V 


Q 


S 


G 


A 


£ 


V 


K 


K 


P 


G 


A 


s 


GTG 


AAG 


117 
GTG 


TCC 


TGC 


126 
AAG 


GCT 


TCT 


135 
GGC 


TAC 


ACC 


144 
TTC 


AGT 


GCC 


153 
TAC 


TGG 


ATA 


162 
GAG 


V 


K 


V 


5 


C 


K 


A 


S 


G 


Y 


T 


F 


S 


A 


Y 


W 


I 


B 


TGG 


GTG 


171 
CGC 


CAG 


GCT 


190 
CCA 


GGA 


AAG 


139 
GGC 


CTC 


GAG 


198 
TGG 


GTC 


GGA 


207 
GAG 


ATT 


TTA 


216 
CCT 


w 


V 


R 


Q 


A 


p 


c 


K • 


c 




K 


W 


V 


a 




I 




*P 


GGA 


AGT 


225 
AAT 


AAT 


TCT 


234 
AGA 


TAC 


AAT 


243 
GAG 


AAG 


TTC 


252 
AAG 


GGC 


CGA 


261 
GTG 


ACA 


GTC 


270 
ACT 


G 


S 


N 


N 


S 


R 


y 


N 


E 


K 


F 


K 


G 


R 


V 


T 


V 


T 


AGA 


GAC 


279 
ACA 


TCC 


ACA 


288 
AAC 


ACA 


GCC 


237 
TAC 


ATG 


GAG 


306 
CTC 


AGC 


AGC 


315 
CTG 


AGG 


TCT 


324 
GAG 


R 


D 


T 


S 


T 


N 


T 


A 


Y 




E 


Lt 


S 


s 


Li 


R 


5 


E 


GAC 


ACA 


333 
GCC 


GTC 


TAT 


342 
TAC 


TGT 


GCA 


351 
AGA 


TCC 


TAC 


360 
GAC 


TTT 


GCC 


3 69 
TCC 


TTT 


CCT 


378 
TAC 



T A V 

3B7 



R 



W 



396 405 410 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



w 


G 


Q 


G 


T 


L 


V 


T 


V 


S 


s 


A 


S 


T 


K 


G 


. ? 


S 






441 






450 






459 






468 






477 






486 


GTC 


TTC 


CCC 


CTG 


GCA 


CCC 


TCC 


TCC 


AAG 


AGC 


ACC 


TCT 


GGG 


GGC 


ACA 


GCG 


GCC 


CTG 


V 


F 


P 




A 


F 


s 


5 


K 


3 


T 


5 


G 


G 


T 


A 


A 








495 






504 






513 






522 






531 






540 


GGC 


TGC 


CTG 


GTC 


AAG 


GAC 


TAC 


TTC 


CCC 


CSAA 


CCC 


GTG 


ACG 


CTC 


TCC 


TCC 


AAC 


TCA 


G 


C 


L 


V 


K 


D 


Y 


F 


p 


E 


P 


V 


T 


V 


S 


W 


N 


S 






54 9 






55S 






567 






576 






585 






594 


GGC 


GCC 


CTG 


ACC 


AGC 


GGC 


GTG 


CAC 


ACC 


TTC 


CCG 


GCT 


GTC 


CTA 


CAG 


TCC 


TCA 


GGA 


G 


A 


L 


T 


S 


G 


V 


H 


T 


F 


P 


A 


V 


L 


Q 


s 


S 


G 






603 






612 






621 






630 






639 






648 


CTC 


TAC 


TCC 


CTC 


AGC 


AGC 


GTG 


GTG 


ACC 


GTG 


CCC 


TCC 


AGC 


AGC 


TTG 


GGC 


ACC 


CAG 






































Hi 


Y 


S 


L 


S 


S 


V 


V 


T 


V 


p 


S 


S 


S 




C 


T 


Q 






657 






666 






675 






684 






693 






702 
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ACC TAG ATC TCC AAC CTC AAT GAC AAC CCC ACC AAjC ACC AAC CTC CAC AAC . AJLA 




N V 



N 



H K 



N 



K V 



K K 



711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



@]068 



K 



K 



H 



765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 

ELI/GGPSVFLFPFKPKDT 

819 829 837 845 955 . S€4 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC CTG CTC CTG GAC GTC ACC CAC 



M 



V V 



V 



H 



873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 



V 



N W 



V 



V 



V H 



N 



927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 



V 



981 990 999 1008 101*3 1036 

CTC CTC ACC CTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG . TAC AAG TGC AAG 



H 



W 



K 



K 



K 



1035 1044 1053 1062 1071 1030 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 



N K 



K 



K 



K 



1089 1098 1107 1116 11Z5 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 

GQ PREPQVYTLtP PS RDE U 

1143 1152* 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 



K N 



V K 



1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 



W 



N N 



K 



1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 



G 



V 



V 



1305 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 



K 



W 



N V 



V M 
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\ " 1359 1368 1377 1386 1395 1404 

: GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCT CCG AAG GGG AGC 



Igj069 




H N 



Y 



K 



K 



1413 1422 1431 1440 1443 1453 

GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG ATG 



K 



1ST 



Q 



1467 1476 14B5 1494 1503 1512 

TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC 



V 



V 



1521 1530 1S39 1S4S 3,557 1566 

GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG 



H 



T - A 



1575 1S84 1533 1602 1611 1620 

CAC AAC CTC AAT CAG GAC CCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA 



A 



D T 



V V 



1629 1638 1S47 1655 1565 3,574 

CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC CAG 



N 



K E 



V 



1683 1692 1701 1710 1719 172B 

GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG AAC 

VSAVDS YYYODGCEFCGN 

1737 17-46 ^ 1755 1764 1773 1782 

GAC ACC TTC AAC CCA CAC CCA GCC ATT GTC AGO TTC TTC TCC CGG TTC ACA GAG 



V 



T 



1791 1800 1809 1818 1827 1B3 6 

GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC 



V 



VREFAIVPLHAAPGDA 

1845 1854 1863 1872 18S1 1890 

GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC TTG 



E 



V 



K 



W 



1899 190 9 1917 192 S 193 5 1944 

CAG CAC GTC ATC TTC ATC CCC CAC TTC AAT CCC CCC TCC ACC TAT CTC ACA CCC 



M 



M 



N 



19S3 1962 1971 1980 1939 1998 

TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG ATC 



W T 



W 



2007 2016 2025 2034 2043 2052 

CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC 



T 



H 



-3 



i 
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Ig] 070 



r 




2061 



2070 



2079 



2086 



2097 



2106 



GTG, GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TC5 GCT CTT CCC 



V 



2115 2124 2133 21/12 3151 2160 

TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT 



N 



D 



2169 2178 2187 

GAC CAC TAT CCA CTG GAC CTC ATC CTC AAC TCA 3 ' 



H 



v 



K 



-4 



2S 
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^ pAS36 



LOCUS 

DEFINITION 
DEFINITION 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



PAS36.DNA 2190 bp 2190 bp DNA 14-AUG-1998 
HUMANISED HMFG1 heavy chain fused to human DNAse - construct 3 6 
Clone 18,24.1 with residue 1392 T > C 

VERHOEYEN ET AL 

CONSTRUCTION OF RESHAPED HMFG1 etc 
IMMUNOL. (1993) :78, 364-370 
Human DNAse sequence is modified as 



(mnctnase. dna) 



a result of oligo assembly 



COMMENT 
FEATURES 
FEATURES 
FEATURES 

SITES 
BASE COUNT 

ORIGIN 



Tne fusion was maae using overlapping oligos AS83 and AS8 4 

AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 



Residue 963 is G 

Residue 1392 T > 

Not 53 

4 98 a 673 



> T leading to silent mutation in all clones 
c silent S to S mutation 



605 g 



409 z 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAG CAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC T AG AT ACAA T 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
3 61 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCACCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CT CAT GAT C T CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGC AG T ACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAG TACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
12 61 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CcCCG GGGAG CGGCGGG CTG AAGATCGCAG CCTTCAACAT CCAGACATTT 
14 41 GGGGAGACCA AG AT G T CC AA TGCCACCCTC GT C AG C T AC A TTGTGCAGAT CCTGAGCCGC 
1501 TACGACATCG CCCTGGTCCA GGAGGTCAGA GACAGCCACC TGACTGCCGT GGGGAAGCTG 
1561 CTGGACAACC TCAATCAGGA CGCACCAGAC ACCTATCACT ACGTGGTCAG TGAGCCACTG 
1621 GGACGGAACA GCTATAAGGA GCGCTACCTG TTCGTGTACA GGCCTGACCA GGTGTCTGCG 
1681 G T GG AC AG C T ACTACTACGA TGATGGCTGC GAGCCCTGCG GGAACGACAC CTTCAACCGA 

17 41 GAG CC AG C CA TTGTCAGGTT CTTCTCCCGG TTCACAGAGG TCAGGGAGTT TGCCATTGTT 

18 01 CCCCTGCATG CGGCCCCGGG GGACGCAGTA GCCGAGATCG ACGCTCTCTA TGACGTCTAC 
18 61 CTGGATGTCC AAGAGAAATG GGGCTTGGAG GACGTCATGT TGATGGGCGA CTTCAATGCG 
1921 GGCTGCAGCT ATGTGAGACC CTCCCAGTGG TCATCCATCC GCCTGTGGAC AAGCCCCACC 
1981 TTCCAGTGGC TGATCCCCGA CAGCGCTGAC ACCACAGCTA CACCCACGCA CTGTGCCTAT 
2 041 G AC A G GAT C G TGGTTGCAGG GATGCTGCTC CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT 
2101 CCCTTTAACT TCCAGGCTGC CTATGGCCTG AGTGACCAAC TGGCCCAAGC CATCAGTGAC 
2161 CACTATCCAG TGGAGGTGAT GCTGAAGTGA 
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v s Co don Table : Universal 




Mode 



ERIC POTTER CLARKS0N 
Normal 



9 18 21 36 4S SC. 

ATC CCA TOO ACC TCT ATC ATC CTC TTC TTC CTA CCA ACA CCT ACA COT CTC CAC 



M 



W 



V 



V H 



072 



63 72 31 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



V 



K K 



117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



K 



K 



W 



171 190 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



W V 



K 



W V 



225 234 . 243 252 251 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



N N 



N 



K 



K 



V 



V 



279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



333 342 351 3 €0 369 378 

CAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



V 



D 



W 



A 



387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



W 



V 



V 



441 450 459 4SS 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



K 



A A 



A95 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



V K 



V 



V 



W N 



549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGV HTFPAVLQSSG 
603 612 621 63D 639 648 

CTC TAC TCC CTC AGC AGC GTG GTC ACC CTC CCC TCC ACC ACC TTG GGC ACC CAG 



V V 



V 



Q 



657 



666 



67 5 



634 
- 1 - 



693 



702 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



T Y 1 CNVMHKPSNT.KVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 



SLI/GG F5V-FLTFFKFK13T 

813 323 837 846 8S5 364 

CTC ATG ATC TCC CGG ACC CCT CAC CTC ACA TCC CTC CTC CTC CAC CTC ACC CAC 



LM I S R-TPEVTCVVVDVS H 

873 882 891, 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 



B D P B V K F NWYVD C V E V H N 

927 936 9&S 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 



AKTKPREEQYNSTYRVVS 

981 990 959 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 



VLTVLMQD-WLN..GKEYKCK 

1035 1044 1053 1062 1072, 10S0 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 



VSNKALPAPIE5CTX S K A X 

1089 1098 1107 1116 1125 113d 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 



GQFREFQVYTLFFSRDEL 

1143 1152 1161 1170 1179 ll.Bfi 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 



TKWQ. VSLTCDVKGFTF5I3 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TCC CAG ACC AAT CCC CAC CCC GAC AAC AAC TAC AAG ACC ACG 



IAVBMESNGQPENrCTYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 



PPVLDSDGSFFLYSKJLTV 

1305 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 



D K S RW Q QGNVFS C S VM .H P. 
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j . 1359 1368 1377 1386 139S laoa 

; GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCC CCG GGG AGC GGC 

» — — — "~ ~ ~ — — ™ — — — — — — — — — — — — - — v _ V S M — _ n < _ 

ALHNHYTQKS.LSLSPflcn 



1413 1422 1431 1440 1449 1458 

GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG ATG TCC 




GLKIAAFNIQTPGETKMS 

1467 1476 1485 149a 1503 1512 

AAT GCC. ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC GCC 



N 


A T 


I. 


V s 


y 


I V 


Q 


I L 


S 


r y 


D 


I A 


CTG 


J- X 
GTC CAG 


GAG 


153 0 
GTC AGA 


GAC 


1539 
AGC CAC 


CTG 


1548 
ACT GCC 


GTG 


1557 
GGG AAG 


CTG 


1S66 
CTG GAC 


T 


V Q 


£ 


V R 


D 


S H 


L 


T A 


V 


G K 


L 


L D 


AAC 


1575 
CTC AAT 


CAG 


1584 
GAC GCA 


CCA 


1593 
GAC ACC 


TAT 


1602 
CAC TAC 


GTG 


1611 
GTC AGT 


GAG 


1620 
CCA CTG 


N 


L N 


Q 


D A 


p 


D T 


Y 


H Y 
** * 


V 


\7 C 

V 0 


Hi 




GGA 


1629 
COG AAC 


AGC 


1638 
TAT AAG 


GAG 


1647 
CGC TAC 


CTG 


1636 
TTC GTG 


TAC 


1565 
AGG CCT 


GAC 


1674 
CAG GTG 


G 


R N 


S 


V K 


E 


R Y 


L 


F V 


y 


K P 


D 


Q v 


TCT 


1683 
GCG GTG 


GAC 


1692 
AGC TAC 


TAC 


1701 
TAC GAT 


GAT 


1710 
GGC TGC 


GAG 


1719 
CCC TGC 


GGG 


1728 
AAC GAC 


S 


A V 


D 


S Y 


Y 


Y D 


D 


G C 


E 


P C 


G 


' N D 


ACC 


1737 
TTC AAC 


CGA 


1746 
GAG CCA 


GCC 


1755 
ATT GTC 


AGG 


1764 
TTC TTC 


TCC 


1773 
CGG TTC 


ACA 


1782 
GAG GTC 



IST R E P A 



1791 1900 1305 iaiB 1827 1836 

AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC GAG 

RE FA IV P L H _ A A P G D A V A E 

1854 1863 1872 1881 1890 

ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC TTG GAG 



I 


D A 


L 


Y D 


V 


Y L 


D 


v o 


E 


K W 


G 


L E 


GAC 


1899 
GTC ATG 


TTG 


1908 
ATG GGC 


GAC 


1917 
TTC AAT 


GCG 


1926 
GGC TGC 


AGC 


1935 
TAT GTG 


AGA 


1944 
CCC TCC 


D 


V M 


Lt 


M G 


D 


F N 


A 


G C 


S 


Y V 


R 


P S 


CAC 


1953 
TCC TCA 


TGC 


19S2 
ATC CCC 


CTG 


191X 
TOO ACA 


ACC 


1980 
CCC ACC 


TTC 


1989 
CAC TCC 


CTC 


1993 
ATC CCC 


Q 


w s 


s 


I R 


L 


W T 


S 


P T 


F 


Q W 


L 


X P 


GAC 


2007 
AGC GCT 


GAC 


2016 
ACC ACA 


GCT 


2025 
ACA CCC 


ACG 


2034 
CAC TGT 


GCC 


2043 
TAT GAC 


AGG 


2052 
ATC GTG 


D 


S A 


D 


T T. 


A 


T P 


T 


H C 


A 


Y D 


R 


I V 
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; " 2061 2070 2079 2089 2097 2106 

V ' .GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC TTT 




VAGMLLRGAVVPDSALPF 

2115 2124 2133 2142 2151 . 2160 

AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT GAC 



N 



2169 217B . 21B7 

CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 

KVPVEV MLK * 



D 



-A- 
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Pi QJUiLt \ 0 



^ PAS37 



LOCUS PAS37.DNA 2226 bp 2196 bp 2196 bp DMA 14-AUG-1998 

DEFINITION HUMANISED HMFG1 heavy chain fused to human DNAse construct 37 

DEFINITION Clone 16.4.2 (same as hcdnasel . dna template file) plus NLS 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 

(mhdnase . dna) 

COMMENT The fusion was made using overlapping oligos AS79 and AS30 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 
FEATURES Residue 963 is G > T leading to silent mutation in all clones 

SITES Note 
BASE COUNT 511 a '683 c 619 g 413 z 

ORIGIN ? 

* 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT G AC AG T C AC T AGAGACACAT CCACAAACAC AGCCTACATG 

3 01 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 

4 91 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
bbl AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CflAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 

1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
1261 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGGGTAA AGGGAGCGGC GGG CTGAAGA TCGCAGCCTT CAACATCCAG 
14 41 ACATTTGGGG AGACCAAGAT GTCCAATGCC ACCCTCGTCA GCTACATTGT GCAGATCCTG 
1501 AGCCGCTACG ACATCGCCCT GGTCCAGGAG GTCAGAGACA GCCACCTGAC TGCCGTGGGG. 
1561 AAGCTGCTGG ACAACCTCAA TCAGGACGCA CCAGACACCT ATCACTACGT GGTCAGTGAG 
1621 CCACTGGGAC GGAACAGCTA TAAGGAGCGC TACCTGTTCG TGTACAGGCC TGACCAGGTG 
1681 TCTGCGGTGG ACAGCTACTA CTACGATGAT GGCTGCGAGC CCTGCGGGAA CGACACCTTC 

17 4 1 AACCGAGAGC CAGCCATTGT CAGGTTCTTC TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC 
13 01 ATTGTTCCCC TGCATGCGGC CCCGGGGGAC GCAGTAGCCG AGATCGACGC TCTCTATGAC 

18 61 GTCTACCTGG ATGTCCAAGA GAAATGGGGC TTGGAGGACG TCATGTTGAT GGGCGACTTC 

1921 AATGCGGGCT GCAGCTATGT GAGACCCTCC CAGTGGTCAT CCATCCGCCT GTGGACAAGC 

1961 CCCACCTTCC AGTGGCTGAT CCCCGACAGC GCTGACACCA CAGCTACACC CACGCACTGT 

2041 GCCTATGACA GGATCGTGGT TGCAGGGATG CTGCTCCGAG GGGCCGTTGT TCCCGACTCG 

2101 GCTCTTCCCT TTAACTTCCA GGCTGCCTAT GGCCTGAGTG ACCAACTGGC CCAAGCCHTC 

2161 AGT GACC ACT ATCCAGTGGA GGTGATGCTG A AGGGGGGCG GACCCAAAAA GAAGCGC AAG 
2221 GTTTGA j — 

n L^> NLS 
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i File : PAS37.DNA 

Rajntsye : I - 222 6 

^ ' - Cod on T^blc : universal 



ERIC POTTER CLARKSON 




Mode - Normal 



9 18 21 36 45 54 

ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



Q] 077 



W 



H 



63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTC CAG TCT GGO GCA GA<? GTG AAA AAG CCT CCC CCC TCA 



V 



K K 



117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



K 



w 



171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



W 



K G 



W V 



22S 234 243 252 261 270 

GGA A/£T AAT AAT TCT AGA TAC AAT CAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



N N 



N 



V 



279 268 237 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



N 



M 



333 342 351 360 363 375 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



D 



W 



3B7 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



K 



441 450 459 468 477 486 

CTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAPS SXSTSfifl'PA a t. 



495 50.4 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



V 



V T 



V 



W • N 



5*9 558 567 576 5B5 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG <3CT GTC CTA CAG TCC TCA GGA 



a 



V H 



603 612 621 630 639 S4B 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



657 



666 



675 



684 
- 1 - 



€93 



702 
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; ACC TAC ATC TGC AAC OTG AAT CAC AAG CCC AGC AAC ACC AAG GTG ©AC AAG AAA 
■s^^- ■» »* — ~ ~ — • — — - — 

/ tyicnvnhkpsntkvdkk 

711 720 729 738 747 755 

GTT GAG CCG AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



V 


E 


p 


K 


S 


C 


D 


K T 


H 


T C 


P. 


P C 


P 


A P 


GAA 


CTC 


765 
CTG 


GGG 


GGA 


774 
CCG 


TCA 


783 
GTC TTC 


CTC 


792 
TTC CCC 


CCA 


801 
AAA CCC 


AAG 


810 
GAC ACC 


E 


L 


L 


G 


G 


P 


5 


V F 


L 


F P 


P 


K P 


K 


D T 


CTC 


ATG) 


913 
ATC 


TCC 


CGG 


S2B 
ACC 


CCT 


837 
GAG GTC 


ACA 


aa6 

TGC GTG 


GTG 


855 
GTG GAC 


GTG 


664 
AGC CAC 


L. 


M 


1 


S 




T 


P 


E V 


T 


C V 


V 


V D 


V 


S K 


GAA 


GAC 


873 
CCT 


GAG 


GTC 


882 
AAG 


TTC 


891 
AAC TGG 


TAC 


900 
GTG GAC 


GGC 


909 
GTG GAG 


GTG 


918 
CAT AAT 


E 


D 


P 


E 


V 


K 


F 


N W 


Y 


V D 


G 


V E 


V 


H N 


GCC 


AAG 


927 
ACA 


AAG 


CCG 


936 
CGG 


GAG 


945 

GAG CAG 


TAC 


954 

AAC AGC 


ACG 


963 
TAC CGT 


GTG 


972 
GTC AGC 


A 


K 


T 


K 


H 


R 


B 


£ Q 


Y 




T 


V R 


V 


V s 


CTC 


CTC 


9S1 
ACC 


GTC 


CTG 


590 
CAC 


CAG 


999 
GAC TGG 


CTG 


looa 

AAT GGC 


AAG 


1017 
GAG TAC 


AAG 


1026 
TGC AAG 


V 


L 


T 


V 


I* 


H 


a 


D W 


L 


N G 


K 


E Y 


K 


C K 


GTC 


TCC 


1035 
AAC 


AAA 


GCC 


1044 
CTC 


CCA 


1053 
GCC CCC 


ATC 


1062 
GAG AAA 


ACC 


1071 
ATC TCC 


AAA 


1080 
GCC AAA 


V 


S 


N 


K 


A 


L 


p 


A P 


I- 


"£ K 


T 


I S 


K 


A K 


GGG 


CAG 


1089 
CCC 


CGA 


GAA 


1098 
CCA 


CAG 


1107 
GTG TAC 


ACC 


1116 
CTG CCC 


CCA 


1125 
TCC CGG 


GAT 


113 4 
GAG CTG 


G 


Q 


p 


R 


B 


? 


Q 


V Y 


T 


L> P 


P 


3 R 




3 L, 


ACC 


AAG 


1143 
AAC 


CAG 


GTC 


1152 
AGC 


CTG 


1161 
ACC TGC 


CTG 


1170 
GTC AAA 


GGC 


1179 
TTC TAT 


CCC 


1198 
AGC GAC 


T 


K 


N 


Q 


V 


S 


L 


T C 


Lj 


V K 


G 


F Y 


p 


S D 


ATC 


GCC 


1197 
GTG 


GAG 


TGG 


1206 
GAG 


AGC 


1215 
AAT GGG 


CAG 


1224 
CCG GAG 


AAC 


1233 
AAC TAC 


AAG 


1242 
ACC ACG 


I 


A 


V 




W 


E 


S 


N G 


Q 


P E 


N 


N Y 


K 


T T 


CCT 


CCC 


1251 
GTG 


CTG 


GAC 


1260 
: TCC 


GAC 


1269 

GGC TCC 


TTC 


1278 

TTC CTC 


TAC 


1287 
AGC AAG 


CTC 


1296 

ACC GTG 




K 



1305 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 

DKSRWQQ'GNVFSCSVMHE 



-2* 



1 
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12)079 




GCT 


■i "5 c n 

1353 
CTG CAC 


AAC 


1369 
CAC TAC 


ACC 


1377 
CAC AAC 


ACC 


4 *^ n 

1366 

CTC' TCC 


CTC 


1395 

TCT CCC 


GGT 


1404 

AAA CCC 


A 


L H 


N 


H Y 


T 


O K 


s 


h S 




S P 


G 


K G 


AGC 


1413 
GGC GGG 


CTG 


1422 
AAG ATC 


GCA 


1431 
GCC TTC 


AAC 


1440 
ATC CAG 


ACA 


1449 
TTT GGG 


GAG 


ia5B 

ACC AAG 


S 


G G 


L 


K I 


A 


A V 


N 


X Q 


T 


P G 


£ 


T K 


ATG 


i.4 bV 
TCC AAT 


GCC 


ACC CTC 


GTC 


AGC TAC 


ATT 


1 a n a 

GTG CAG 


ATC 


1503 
CTG AGC 


CGC 


1512 
TAC GAC 


M 


S N 


A 


T L 


V 


S Y 


I 


V Q 


I 


L S 


R 


Y D 


ATG 


1521 
CCC CTC 


CTC 


153 0 
CAC GAC 


GTC 


1539 
AG A GAC 


AGC 


At r 1 M #^ 

1548 
CAC CTG 


ACT 


1557 
GCC GTC? 




1566 
AAC CTC 


I 


A L, 


V 


Q E 


V 


R D 


S 


H h 


T 


A V 


G 


K L 



1575 15S4 1593 1602 1611 . 1620 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 



L 


D N 


1* 


N Q 


D 


A P 


I> 


T Y 


H 


Y V - 


V 


S £ 


CCA 


1629 
CTG GGA 


CGG 


1638 
AAC AGC 


TAT 


1647 
AAG GAG 


CGC 


1656 
TAC CTG 


TTC 


1665 
GTG TAC 


AGG 


1574 
CCT GAC 


P 


L G 


R 


N S 


Y 


K . E 


R 


Y L 


F 


V Y 


R 


P D 


CAC 


1683 
GTG TCT 


GCG 


1692 
GTG GAC 


AGC 


1701 
TAC TAC 


TAC 


1710 

CAT CAT 


CCC 


1719 
TCC CAG 


CCC 


1728 
TGC GGG 


Q 


v S 


A 


V D 


S 


Y Y 


Y 


D D 


G 


C E 


P 


C G 


AAC 


1737 
GAC ACC 


TTC 


1746 
AAC CGA 


GAG 


1755 
CCA GCC 


ATT 


1764 
GTC AGG 


TTC 


1773 
TTC TCC 


CGG 


1782 
TTC ACA 


N 


D T 


F 


N R 


E 


P A 


I 




F 


F S 


R 


? T 


GAG 


X791 
GTC AGG 


GAG 


1B00 
TTT GCC 


ATT 


1809 
GTT CCC 


CTG 


iaia 

CAT GCG 


GCC 


1B27 
CCG GGG 


GAC 


1836 
GCA OTA 


E 


V R 


£ 


F A 


I 


V P 


L 


H A 


A 


P G 


D 


A V 


GCC 


1845 
GAG ATC 


GAC 


ia54 

GCT CTC 


TAT 


1863 
GAC GTC 


TAC 


1872 
CTG GAT 


GTC 


1831 
CAA GAG 


AAA 


1890 
TGG GGC 


A 


B I 


D 


A L 


Y 


D V 


Y 


L D 


V 


Q E 


K 


W G 



1899 190S 1917 1926 1935 194-4 

TTC CAC CAC CTC ATC TTC ATC CGC GAC TTC AAT CCC CCC TCC ACC TAT CTG AGA 



E 



D V 



M 



N 



1353 1962 1971 1980 1989 1995 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG 



W 



W 



T 



2007 2016 2025 2034 2043 2052 

ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 



T 



T 

3- 



H 



A 



7k 
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- . fc 2061 2070 2079 2088 2097 2106 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 




IVVAGMLLRG AV VPDSAL 



2115 2124 2133 2142 2151 2160 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 



P F N F Q A A Y G L S V Q L. <.A , ' Q A Z 

2169 217B 2187 2196 2205 2214 

ACT GAC CAC TAT CCA OT© GAG GTG ATG GTG AAG GGG GGC GGA CCC AAA AAG AAG 



£ D H Y P V K VM L, IC G G G ' P K K K 
2223 

. CGC AAG GTT TGA 3' 
R K V 



. i 



- a - 
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g)081 




( 



(fy pAS38 



LOCUS 

DEFINITION 
DEFINITION 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
COMMENT 



(mhdnase . dna) 



PAS 3 8 . DNA 2223 bp • 2193 bp DNA ld-AUG-1998 

HUMANISED HMFG1 heavy' chain fused to human DNAse construct 38 
Clone 17.12.1 with silent K to K mutation {1398 A > G)+NLS 

VERHOEYEN ET AL 

CONSTRUCTION OF RESHAPED HMFG1 etc 
IMMUNOL. (1993) :78, 364-370 

Human DNAse sequence is modified as a result of oligo assembly 



COMMENT 
FEATURES 
FEATURES 
FEATURES 

SITES 
BASE COUNT 
ORIGIN 



The fusion was made using overlapping oligos AS81 and AS82 

AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

Residue 963 is G > T leading to silent mutation in all clones 

In 17,12,1 residue 1398 is A > G (silent K to K mutation) 
Note 

510 a 683 c 618 g 412 t 

9 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
51 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
131 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 

2 41 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AG C C T AC AT G 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 

3 61 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 

4 21 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AG G AC T C T AC 
601 TCCCTCAGCA CCCTCCTGAC. CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
561 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
7 21 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAftC TCCTGGGGGG ACCGTCAGTC 
7 81 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 CGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAftG 

1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
10 81 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CC&.G CGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCC.GT GCTGGACTCC 
12 61 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GAT GCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGAA gGG GAGCGGCGGG CTGAAGATCG CAGCCTTCAA CAT C C AG AC A 
14 41 TTTGGGGAGA CCAAGATGTC CAATGCCACC CTCGTCAGCT ACATTGTGCA GATCCTGAGC 
1501 CGCTACGACA TCGCCCTGGT CCAGGAGGTC AGAGACAGCC ACCTGACTGC CGTGGGGAAG 
1561 CTGCTGGACA ACCTCAATCA GGACGCACCA GACACCTATC ACTACGTGGT CAGTGAGCCA 
1621- CTGGGACGGA ACAGCTATAA GGAGCGCTAC CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT 
1681 GCGGTGGACA GCTACTACTA CGATGATGGC TGCGAGCCCT GCGGGAACGA CACCTTCAAC 
1741 CGAGAGCCAG CCATTGTCAG GTTCTTCTCC CGGTTCACAG AGGTCAGGGA GTTTGCCATT 
1301 GTTCCCCTGC ATGCGGCCCC GGGGGACGCA GTAGCCGAGA TCGACGCTCT CTATGACGTC 
18 61 TACCTGGATG TCCAAGAGAA ATGGGGCTTG GAGGACGTCA TGTTGATGGG CGACTTCAAT 
1921 GCGGGCTGCA GCTATGTGAG ACCCTCCCAG TGGTCATCCA TCCGCCTGTG GACAAGCCCC 
,1981 ACCTTCCAGT GGCTGATCCC .CGACAGCGCT GACACCACAG CTACACCCAC GCACTGTGCC 
2041 TAT GACAGGA TCGTGGTTGC AGGGATGCTG CTCCGAGGGG CCGTTGTTCC CGACTCGGCT 
2101 CTTCCCTTTA ACTTCCAGGC TGCCTATGGC CTGAGTGACC AACTGGCCCA AGCCATCAGT 
2161 GACCACTATC CAGTGGAGGT GATGCTGA AG GGGGGCGGAC CCAAAAAGAA GCGCAAGGTT 
2221 TGA ' 
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File = PAS38.DNA 

Range : 1 - 2223 

Co'dLon Tatile : universal 



Modo t Normal 



9 18 27 36 . 45 54 

5' ATG GGA TGG ACC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



M 



W 



V 



H 



63 72 81 90 99 108 

TCC CAG 15 TO CAG CTG GTG CAG TCT GGG GCA GAG GTC AAA AAG CCT GGG GCC TCA 



082 



v 



K 



117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGG TAC ACC TTC AGT GCC TAG TGG ATA GAC 



K 



W 



171 180 189 198 207 216 

TCC GTG CGC CAC CCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



W V 



£ 



E 



225 234 243 2S2 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAC CGC CGA GTG ACA GTC ACT 



N N 



N S 



K 



K 



279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



D 



N 



M 



333 342 351 360 359 370 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



V 



Y 



D 



W 



387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTC CTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



W 



V 



V 



441 450 4S9 468 477 496 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GCC ACA CCC CCC CTG 



V 



K 



495 304 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



K 



V 



V 



W N 



549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



603 612 621 630 639 648 

CTC TAC TCC CTC ACC AGC CTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



657 



666 



£75 



684 
- 7 - 



693 



702 
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! • ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG CAC AAG 

'T'yicnvnhkpsntkvdkk 

711 72 0 72 9 73 8 747 7 56 

GTT GAG CCC AAA TCT TOT GAC AAA ACT CAC AC A TGC CCA CCG TGC CCA GCA CCT 




VEPKSCDKTHTCPPCFAP 

7S5 774 "733 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 

ELLGGPSVFLFPPKPKDT 

819 828 337 846 853 664 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 

LMXSRTPEVTCVVVDVS H 

873 882 891 900 909 918 

CAA CAC CCT GAG GTC AAC TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

BDPEVKF NWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 

T 

aktkpreeqyn styr vvs 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 



VLTVLHQDWLKCKEYKC K 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 

VSNKALPAP I EKTISKAK 

108S 1098 . 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 



GQPREP QVYTLPPSKDSI, 

1143 1132 1161 1170 1179 1183 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 

TK NQVSI* TC LVKGPY P S D 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 

IAVEWESNGQ .PENNY .KTT 

1251 1260 1259 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 

PFVLDSDGSFFLYSKL.TV 

1305 1314 1323 1332 1341 13S0 

CAC AAG AGC ACC TCC CAC CAG GGG AAC CTC TTC TCA TCC TCC GTG ATG CAT GAG 

DKSRWQQGWVPSCSVMHE 



-2- 
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CCT 


1359 
CTG CAC 


AAC 


1368 
CAC TAG 


ACG 


137 7 
CAG AAG 


AGC 


1386 

CTC TCC 


CTG 


TCT CCG 


AAG 


1404 
GGG AGC 


A 


L K 


N 


H Y 


T 


Q K 


S 


Li S 


L 


S P 


K 


G S 


GGC 


1413 
GGG CTG 


AAC 


1422 
ATC GCA 


GCC 


143 1 
TTC AAC 


ATC 


1440 
CAG ACA 


TTT 


JL4a9 

GGG GAG 


ACC 


1458 
AAG ATG 


G - 


G L 


K 


I A 


A 


F N 


I 


Q T 


* 

r 


G £ 


T 


K M 


TCC 


1467 
AAT GCC 


ACC 


1476 
CTC GTC 


AGC 


1485 
TAG ATT 


GTG 


1494 
CAG ATC 


CTG 


1S03 
ACC CGC 


TAC 


1512 
GAC ATC 



N 



A 



V 



V 



X. 



1521 1530 1539 1548 1557 15C6 

GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG 



®0$4 



T 



V 



1S75 1584. 1593 1602 1611 1620 

GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA 



N 



IT 



H 



V V 



1629 1638 1647 1656 1665 1674 

CTG GGA CGG AAC AGC TAT AAC CAC CCC TAC CTC TTC GTG TAC AGG CCT GAC CAG 



N 



Q 



1683 1692 1701 1710 1719 1728 

GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG AAC 



N 



1737 1746 1755 1764 1773 1782 

GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG 



T 



N 



1791 1800 180O 1818 1S27 ' 1836 

GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC 



H 



1845 1854 1363 1872 1881 1890 

GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC TTG 



D 



D V 



D V 



K W 



1899 1906 1917 1926 1935 1944 

GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TOC AGC TAT GTG AGA CCC 



19S3 1962 1971 1980 1989 1999 

TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG ATC 



W 



w 



2007 2016 2025 2034 2043 2052 

CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC 



H C 

3- 
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I " 

■ ■ 2061 2070 2079 -3088 . 2057 2106 

/ GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC 




VVAGMLLRGAVVPDSALP 

2115 2124. 2133 2142 2151 2160 

TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT 

FNFQAAVGLS DQLAQAIS 

2160 217B 2187 2196 2205 2214 

GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA AAG AAG CGC 

DHYPVB-VMLiKGGG PKKKR 

2223 
AAG GTT TGA 3' 



4 - 



Received 03-Apr-DO 15;24 



From-01 159 552201 



To-^THE PATENT OFFICE Page 85 



03/04 00 MON 14:48 FAX 01159 552201 



ERIC POTTER CLARKSON 




U) PAS39 



LOCUS PAS 39. DN A 2220 bp 2190 bp DMA 14-AUG-1998 

DEFINITION HUMANISED HMFG1 heavy chain fused to human DNAse - construct 3 9 
DEFINITION Clone 18.24.1 with residue 1392 T > C +NLS 

REFERENCE 

AUTHORS VERHOEYEN £T AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL, (1993) =78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase - dna) 

COMMENT The fusion was made using overlapping oligos AS83 and AS84 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

FEATURES Residue 1392 T > C silent S to S mutation 

SITES Note 
BASE COUNT 508 a 684 C 617 g 411 t 

ORIGIN ? 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 

61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 

121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACT GGAT AG AGTGGGTGCG CCAGGCTCCA 

181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 

241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 

301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 

3 61 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 

4 21 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
4 81 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
6 61 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 

8 41 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AG C AG T AC AA C AG C AC G T AC 

9 61 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 

1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
12 01 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
12 61 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AG AG C AG G T G GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CcCCG GGGAG CGGCGGG CTG AAGAT C GC AG CCTTCAACAT CCAGACATTT 
14 41 GGGGAGACCA AGATGTCCAA TGCCACCCTC GTCAGCTACA TTGTGCAGAT CCTGAGCCGC 
1501 TACGACATCG CCCTGGTCCA GGAGGTCAGA GACAGCCACC TGACTGCCGT GGGGAAGCTG 
1561 CTGGACAACC TCAATCAGGA CGCACCAGAC ACCTATCACT ACGTGGTCAG TGAGCCACTG 
1621 GGACGGAACA GCTATAAGGA GCGCTACCTG TTCGTGTACA GGCCTGACCA GGTGTCTGCG 
1681 GTGGACAGCT ACTACTACGA TGATGGCTGC GAGCCCTGCG GGAACGACAC CTTCAACCGA 
•17 41 GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG TTCACAGAGG TCAGGGAGTT TGCCATTGTT 
18 01 CCCCTGCATG CGGCCCCGGG GGACGCAGTA GCCGAGATCG ACGCTCTCTA TGACGTCTAC 
18 61 CTGGATGTCC AAGAGAAATG GGGCTTGGAG GACGTCATGT TGATGGGCGA CTTCAATGCG 
1921 GGCTGCAGCT ATGTGAGACC CTCCCAGTGG TCATCCATCC GCCTGTGGAC AAGCCCCACC 
1981 TTCCAGTGGC TGATCCCCGA CAGCGCTGAC ACCACAGCTA CACCCACGCA CTGTGCCTAT 
2041 G AC AG GAT CG TGGTTGCAGG GATGCTGCTC CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT 
2101 CCCTTTAACT TCCAGGCTGC CTATGGCCTG AGTGACCAAC TGGCCCAAGC CATCAGTGAC 
2161 CACTATCCAG TGGAGGTGAT GCTGAAGGGG GGCGGACCCA AAAAGAAGCG CAAGGTTTGA 

// 
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Moda : Normal 




9 18 27 36 45 5d 

ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT AC A GGT GTC CAC 



(2)037 



M 



W 



V 



63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT CCC CCA GAG GTG AAA AAGT CCT GGG GCC TCA 



V 



V 



K K 



117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



V K 



V 



w 



171 180 189 19B 207 215 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



W 



to 



225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT CAC AAG TTC AAG GGC CCA CTG A.CA GTC ACT 



N N 



R 



N E 



K 



K 



V 



279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



N 



333 342 351 360 363 373 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



D 



V 



R 



F 



W 



3B7 396 405 &X& 423 432 

TGG GGC CAA GCG ACT CTC CTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



w 


G 


Q 


G 


T 


L 


V 


T 


V 


S 


s 


A 


s 


T 


K 


G 


P 


s 


GTC 


TTC 


441 
CCC 


CTG 


GCA 


450 
CCC 


TCC 


TCC 


459 
AAG 


AGC 


ACC 


468 
TCT 


GGG 


GGC 


477 
ACA 


GCG 


GCC 


486 
CTG 


V 


F 


P 


L 


A 


P 


s 


S 


K 


S 


T 


S 


G 


G 


T 


A 


A 


L 


GGC 


TGC 


CTG 


GTC 


AAG 


504 
GAC 


TAC 


TTC 


513 
CCC 


GAA 


CCG 


522 
GTG 


ACG 


GTG 


531 
TCG 


TGG 


AAC 


540 
TCA 


G 


C 


L> 


V 


K 


V 


Y 


P 


P 


E 


P 


V 


T • 


V 




W 


M 


£ 


GGC 


GCC 


549 
CTG 


ACC 


AGC 


558 
GGC 


GTG 


CAC 


567 
ACC 


TTC 


CCG 


576 
GCT 


GTC 


CTA 


585 
CAG 


TCC 


TCA 


594 
GGA 


G 


A 


L 


T 


S 


G 


V 


H 


T 


? 


P 


A 


V 


L 


Q 


S 


S 


G 


CTC 


TAC 


603 
TCC 


CTC 


AGC 


612 
AGC 


GTG 


GTG 


621 
ACC 


GTG 


CCC 


630 
TCC 


AGC 


AGC 


639 
TTG 


GGC 


ACC 


648 
CAG 




Y 


S 

657 


L 


S 


S 
666 


V 


V 


T 
675 


V 


p 


s 
es<a 


s 


s 


L 
€93 


G 


T 


Q 
702 



1 - 
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i " ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AOC AAC ACC AAQ GTG CAC AAC AAA 

* — M ^ W V« ^ — — *— v V ^ V «l — —~ — ^» ^ ^ ^ — — — — — — ™ — — — * ™ ■ ~ ™* ~* ^ ~ "~ ~ 

TYICNVNHKPSNTKVDKK 



711 720 729 733 747 756 

GTT GAG CCC AAA TCT TCT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 




vep'kscdkthtcppcpap 

765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 

E L LGG P S V F I* F P P K PK DX 

819 828 837 846 8SS 964 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTC ACC CAC 

I>KISRTPEVTCVVVDV5 H 

873 882 891 900 909 918 

GAA CAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

EDPEVKFNWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 



AKTKPREEQYNSTY RVVS 

981 990 999 1003 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAC TCC AAG 

VL TV L H Q DW L N C K • E YKC K 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 



V SNKALFAPIEKTI SKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 



1143 1152 1161 r 1170 1179 HS8 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 

TKNQVSLTCLVKGFYPSD 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 



I AVEWESNGQ PENNY KTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 



P y V L» d 3D<?.*"«T"rLiYS K I* T V 

1305 1314 1323 1332 1341 1350 

GAC AAG AGC AGC TGG CAG CAC GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 

DK5RWQOG NVFSCSVMHE 



-2 
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\ ' 1359 1368 1377 1366 1395 14.04 

•GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCC CCC GGG AGC CCC 




lhnhytqkslslspgsg 



1413 1422 1431 1440 1449 1458 

GGG CTC AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG ATG TCC 



GLKIAAFNIQTFGETKMS 

1467 1476 1485 1494 1503 1512 

AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC GCC 



NATLVS YIVQI LSRY DIA 

1521 1530 1D39 1548 1557 1566 

CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG CCC AAC CTG CTG GAC 



LVQEVRDSHL-TAVGKkkD 

1575 1584 1593 1602 1611 1620 

AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA CTG 



NLNQDAP DTYHYVVS E PL 

1629 1538 1647 1656 1665 1674 

GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC CAG GTG 



GRNSYKERYLFVYRP. DQV 

1683 1692 1701 1710 1719 1728 

TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT CCC TGC GAG CCC TGC GCC AAC GAC 



SAVDSYYYDDGCEPCGND 

1737 1746 1755 1764 1773 17B2 

ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG GTC 



TFNREPA IVRFF.SRFTEV 

1791 1800 1809 IBIS 1B27 1836 

AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC GAG 



REFAIVFLHAAFGDAVAE 

1845 1854 1863 1872 1881 1890 

ATC CAC CCT CTC TAT GAC GTC TAC CTC CAT GTC CAA GAG AAA TGG GGC TTG GAG 



IDALYDVYL DVQEKWGLE 

1899 190B 1917 1926 1935 19<S4 

GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA CCC TCC 



DVMLMGDFNA.GCSYVRPS 

1963 1962 1971 1350 1389 1998 

CAG TGG TCA TCC ATC CGC CTC TGG ACA AGC CCC ACC TTC CAG TGG CTG ATC CCC 



QWSSIRLWTSPTPQWLIP 

2007 2016 2025 2034 20d3 2052 

GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC GTG 



DSADTTATPTHCAYDRIV 

-3- 
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[gj090 




2061 2070 2079 2089 2097 2106 

GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC TTT 



M 



V 



2115 2124 2133 2142 2151 2160 

AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT GAC 

NFQAAYGLSDQLA QAISD 

21S9 2173 2187 219S 2205 2214 

CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA AAG AM CGC AAG 



H 



M 



K 



K 



K K 



K 



OTT TGA 3' 



V 



- 4 - 



4 > 
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(A) PASIOI 



LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 



PAS101.DNA 1548 bp 



mRNA 



PRI 



06-MAR-1995 



Humanised HMFG1 Fab 1 2 fused to human DN^ss I (pASlOl) 



DNase I. 

DNase I sequence is from asaerttfDlea oligos trhus modified c/£ 



MHDNASEl.dna) 

ORGANISM Homo sapiens 



AUTHORS 
TITLE 

JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 

1 
61 
121 
181 
241 
301 
361 

421 
481 
541 
601 
661 
721 
781 
341 

901 
961 
1021 
1031 
1141 
1201 
1261 
1321 
1381 

1441 
1501 



EuKaryotae; mltochonarial eukaryotes; Metazoa; Chgrdata; 
Vertefcrata; Eutheria; Primates; Catarrhini,- Hominidae; Homo . 
ShaK,S., Capon, D.J-, Hellroiss, R. , Marstera,S.A, and Baker,C.L. 
Recombinant human DNase I reduces the viscosity o£ cystic fibrosis 
sputum 

Proc. Natl. Acad. Sci . U.S.A. 97 (23), 9199-9192 (1990) 
91067672 

343 a 467 c 430 g 308 t 



ATGGGATGGA 
GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
TTCAACATCC 
GTGCAGATCC 
ACTGCCGTGG 
GTGGTCAGTG 
CCTGACCAGG 
AACGACACCT 
AGGGAGTTTG 
GCTCTCTATG 
ATGGGCGACT 
CTGTGGACAA 
CCCACGCACT 
GTTCCCGACT 
GCCCAAGCCA 



GCTGTATCAT 
TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 

ACACATGCCC 
AGACATTTGG 
TGAGCCGCTA 
GGAAGCTGCT 
AGCCACTGGG 
TGTCTGCGGT 
TCAACCGAGA 
CCATTGTTCC 
ACGTCTACCT 
TCAAT GCGGG 
GCCCCACCTT 
GTGCCTATGA 
CGGCTCTTCC 
TCAGTGACCA 



CCTCTTCTTG 
GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCGCTCC 
CAACACCAAG 

ACCGTGCCCA 
GGAGACCAAG 
CGACATCGCC 

GGACAACCTC 
ACGGAACAGC 
GGACAGCTAC 
GCCAGCCATT 
CCTGCATGCG 
GGATGTCCAA 
CTGCAGCTAT 
CCAGTGGCTG 
CAGGATCGTG 
CTTTAACTTC 
CT AT G C AG T G 



GTAGCAACAG 
AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 

AGAGACACAT 

GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 

G C AC C T GAAG 
ATGTCCAATG 
CTGGTCCAGG 
AATCAGGACG 
TATAAGGAGC 
TACTACGATG 
5TCAGGTTCT 
GCCCCGGGGG 
GAGAAAT GGG 

GTGAGACCCT 
ATCCCCGACA 
GTTGCAGGGA 
CAGGCTGCCT 
GAGGTGATGC 



CTACAGGTGT 
GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 

ACTGTGCAAG 

CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCGAGAC 
AAGTTGAGCC 

GCGGGCTGAA 



CCACCCTCGT 
AGG T C AG AG A 

CACCAGACAC 

GCTACCTGTT 
ATGGCTGCGA 
TCTCCCGGTT 
ACGCAGTAGC 
GCTTGGAGGA 
CCCAGTGGTC 
G CG C T G AC AC 
TGCTGCTCCG 
ATGGCCTGAG 
TGAAGTGA 



CCACTCCCAG 
GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AG C C T AC AT G 
ATCCTACGAC 

AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GATCGCAGCC 
CAGCTACATT 
•CAGCCACCTG 

CTATCACTAC 
CGTGTACAGG 
GCCCTGCGGG 
C AC AG AG G T C 
CGAGATCGAC 
CGTCATGTTG 
ATCCATCCGC 
C AC AG C T AC A 
AGGGGCCGTT 

TGACCAACTG 
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(2)092 



9 File : PAS101.DNA 
Range : 1 - 




Codon Table 



1548 

Universal 



Mode ; Normal 



ft Cuff 13. ft) 



9 is 21 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



M 



V H 



63 72 81 90 99 ' 1°S 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



V. 



V 



E 



K 



117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



K 



W 



171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



W V 



O 



W V 



225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



Itf N 



N 



V 



V 



279 238 297 306 315 32/1 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG ^CC TCT CAG 



R 



333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



V 



R 



W 



387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



W 



V 



V 



K 



441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



V 



K 



495 504 513 522 531 540 

CCC TCC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



V 



D 



V 



W N 



S49 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA .CAC TCC TCA GGA 

GALTSG.VHTF F A V L Q S S G 

- 1 - 
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i 

603 612 621 630 639 648 




. CTC TAG TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPS3SLGTQ 

657 66$ 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

CTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 



EGGfcKlAAFNIQTFGET K 

819 828 837 846 855 864 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 



MSNATLVSYIVQILSRYD 

873 882 991 900 909 918 

ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG 



IAL.VQEVR D S HLTAVGKL 

927 936 945 954 963 972 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 



LDNLNQ'DA PDTYHYVV S E 

981 990 999 1008 1017 1026 

CCA CTC CCA CCC AAC AGC TAT AAG GAC CGC TAC CTG TTC GTG TAC AGG. CCT GAC 



DlrGRNSYKERYI-FVYRPD 

103S 1044 1053 1062 1071 1080 

CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG 



QVSA VP 5 Y Y Y DDGC E ? C G 

10B9 1098 1107 1116 1125 1134 

AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA 



NDTFNREPAIVRFFSRFT 

1143 1152 1161 1170 1179 1188 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 



EVRT5FAIVPLHAAPGDAV 

1197 1206 1215 122fl 1233 1242 
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GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG CCC 
^^TT^ar. vDVYLDVQEKWG 



1251 1260 1269 127B 1287 1296 

TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA 

LEDVMLMGDFNAGC SYVR 

1305 1314 1323 1332 1341 1350 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG 



w 



WTSPTFQW 



1359 1369 1377 1386 1395 1404 

ATC CCC CAC AGC GCT CAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 



IPDSADTTA TPTH.C AYDR 

1413 1422 1431 1440 1£&9 1458 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCC CCT CTT 



KLLRGAVVPD SA 



1467 • 1476 1485 1494 1503 1512 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 



N 



1521 1530 1539 1548 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

SDHYPVEVML K* 
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@095 



(fl PAS102 



LOCUS 



PAS102.DNA 1566 bp 



mRNA 



PRI 



06-MAR-1995 



DEFINITION Humanised HMFG1 Fab ' 2 fused to human DKTase I (pAS102) 

ACCESSION ^ 

NID 

KEYWORDS 
SOURCE 



DNasa I _ 

DNase I sequence is from assembled olig/os (thus modified c/f 



MHDNASE1 . dna ) t£C«. f^ff^C IS) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
5hak,S., Capon,D.J., Hellmiss,R., Marsters , S . A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic fibrosis 
sputum 

Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 
91067672 

345 a 469 c 440 g 312 t 



AUTHORS 
TITLE 



JOURNAL 
MEDLINE 

BASE COUNT 
ORIGIN 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACT GGAT AG AGTGGGTGCG CCAGGCTCCA 
1B1 GGAAAGGGCC TCGAGTGGGT CG GAG AG ATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGA5T GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 

541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 

6 61 AACG T GAAT C ACAAGCCCAG GAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 

7 21 GACAAAACTC ACACATGCTG TGTCGACTGC CGACCGTCCC CAGCACCTGA AGGGACCGGC 
781 GGGCTGAAGA TCGCAGCCTT CAACATCCAG ACATTTGGGG AGACCAAGAT GTCCAATGCC 
841 ACCCTCGTCA GCTACATTGT GCAGATCCTG AGCCGCTACG ACATCGCCCT GGTCCAGGAG 
901 G T CAG AG AC A GCCACCTGAC TGCCGTGGGG AAGCTGCTGG ACAACCTCAA TCAGGACGCA 
961 CCAGACACCT ATCACTACGT GGTCAGTGAG CCACTGGGAC GGAACAGCTA TAAGGAGCGC 

1021 TACCTGTTCG TGTACAGGCC TGACCAGGTG TCTGCGGTGG ACAGCTACTA CTACGATGAT 
1081 GGCTGCGAGC CCTGCGGGAA CGACACCTTC AACCGAGAGC CAGCCATTGT CAGGTTCTTC 
1141 TCCCGGTTCA CAG AG G T CAG GGAGTTTGCC ATTGTTCCCC TGCATGCGGC CCCGGGGGAC 
1201 GCAGTAGCCG AGATCGACGC TCTCTATGAC GTCTACCTGG ATGTCCAAGA GAAATGGGGC 
12 61 TTGGAGGACG TCATGTTGAT GGGCGACTTC AATGCGGGCT GCAGCTATGT GAGACCCTCC 
1321 CAGTGGTCAT CCATCCGCCT GTGGACAAGC CCCACCTTCC AGTGGCTGAT CCCCGACAGC 
1381 GCTGACACCA CAGCTACACC CACGCACTGT GCCTATGACA GGATCGTGGT TGCAGGGATG 
14 41 CTGCTCCGAG GGGCCGTTGT TCCCGACTCG GCTCTTCCCT TTAACTTCCA GGCTGCCTAT 
1501 GGCCTGAGTG ACCAACTGGC CCAAGCCATC AGTGACCACT ATCCAGTGGA GGTGATGCTG 
1561 AAGTGA 



Received 03-Apr-QQ 15:24 



From-01159 5522D1 



Tq-THE PATENT OFFICE Page 95 



59^ 



03/04 '00 MON 14:52 FAX 01159 552201 ERIC POTTER CLARK SON B096 



» File : PA5102 . OTA 

Range : 1 - 1566 Mode : Normal 

Codon Table : Universal 




9 is 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT ACA GGT GTC CAC 



M G W 



FLVATATGVH 



63 11 81 90 99 " 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

S Q * V Q L V Q ' S G A E V A K ' P G A S 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA. CAG 



V K V 



w 



171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT tTA CCT 



WVRQA-PGKG&EWVGEILP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTBITAYME. LSSLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT CCA ACA TCC TAC GAC TTT GCC TGC TTT CCT TAC 

DTAV YYCARS YDFAWFAY 

3S7 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WG QGT L.VTV S 5 A S T K G P S 

441 450 459 458 177 436 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAPSSKSTS GGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSWNS 

549 558 557 576 535 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGVHTF PAVLUSSG 

- 1 - 



5? 
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®097 




603 612 621 630 539 648 

CTC TAC TCC CTC AGC ACC CTG GTG ACC CTG CCC TCC AGC AGC TTG GGC ACC CAG 



657 ' G66 S75 684 £93 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC MC CTG GAC AAC AAA 

TVICNVNHKPSHTKVD.KK 

■JH 720 729 738 747 756 

GTT GAG CCC AAA TCT TOT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 



K 



U T C C V 



7G5 774 783 792 801 810 

TGC CCA GCA CCT GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG 

CPAPEGSGGLKIAAFNIQ 

819 528 837 846 855 S64 

ACA TTT GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG 

TFG E TKMS WATX.V S Y 1 V Q 

673 882 891 900 ^ 909 918 

ATC CTG AGC CGC TAC GAC ATC GCC CTG CTC CAG GAG GTC ACA CAC AGC CAC CTG 

XLSRYDXALVQEVRDSKL 

327 936 345 954 963 972 

ACT GCC GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT 

TAVGKLLDNLlsrQDAPDTY 

981 990 999 1003 1017 1026 

CAC TAC GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG 

HYVVS EPLGRNSVKERYL 

1035 1044 1053 1062 1071 1080 

TTC GTG TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT 

FVTRPDQVSAVD3YYYDD 

1089 1098 1107 1116 1125 1134 

GGC TGC GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG 

GCEP CCNDTFNREPAIVR 

1143 1152 1161 1170 1179 1183 

TTC TTC TCC CGG TTC ACA CAC CTC AGG GAG TTT CCC ATT GTT CCC CTG CAT GCG 

FFSR FTEVREFA1VPL HA 
1137 1206 1215 1224 1233 1242 

-2- 
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x GCC CCG GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GA>T 



@]098 




APGDA VAEIDAL-'YDVYLD 

1251 1260 1269 127S 1287 1296 

GTC CAA GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC aat GCG 

vqekwgledvmlmgdfna 

1305 1314 1323 1332 1341 1350 

GGC TGC AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC 

GCSYVRPSQWSS I R L, W T S 

1359 1368 1377 1386 1395 1404 

CCC ACC TTC CAG TGG CTC ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG 

PTFQWL I pDSADTTATP T 

1413 1422 1431 1440 1449 1458 

CAC TGT GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTC CTC CGA GGG GCC GTT 

HCAYDRlVVAGMLiLKGAV 

1467 1476 1485 1494 1503 1512 

GTT CCC GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC 

VPDSALPFNFQAAYGLSD 

1521 1530 1539 1548 > 1557 1565 

CAA CTG GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

GL-AOAISDHYPVEVMLK* 



J 
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|0 pAS103 



06-MAR-1995 



LOCUS PAS103.DNA 1560 bp mRNA PRI 

DEFINITION Humanised HJMFG1 Fab 1 2 fused to human DNase I (pAS103) 

ACCESSION 
NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASEl.dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata? 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S., Capon, D.J,, Hellmiss, R. , Marsters, S. A. and Baker , C. L . 
Recombinant human DNase I reduces the viscosity of cystic fibrosis 
sputum 

Proc. Natl, Acad. Sci. U.S.A., 87 (23) , 9183-9192 (1990) 
91067672 

344 a 468 c 436 g 312 t 



AUTHORS 
TITLE 



JOURNAL 
MEDLINE 

BASE COUNT 
ORIGIN 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG G TAG CAACAG CTACAGGTGT CCACTCCCAG 

61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 

121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC T AC T G GAT AG AGTGGGTGCG CCAGGCTCCA 

181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 

241 GAGAAGTTCA AGGGCCGAGT GAC AG T C ACT AGAGACACAT CCACAAACAC AGCCTACATG 

301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 

3 SI TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC ACCCTCCACC 

421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 

491 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 

541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 

601 TCCCTCAGCA GCGTCCTCAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 

661 AACGTGAATC ACAAGCCCAG CAACACCRAG GTGGACAaCA &ACTTCAGCC CAAATCTTGT 

721 GACAAAACTC ACACATGCTG TGTGGAGTGC CCACCGTGCC CAGCACCTGA AGCCGGG CTG 

781 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 

841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 

901 GACAGCCACC TGACTGCCGT GGGGAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 

961 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 

1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 

1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 

1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 

1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 

1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 

1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 

1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GAC AG GAT CG TGGTTGCAGG GATGCTGCTC 

1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 

1501 AGTGACCAAC TGGCCCAAGC CAT CAGTGAC CACTATCCAG TGGAGGTGAT GCTGAAGTGA 
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' File : FAS103 _DNA 

Range : 1 - 15 60 Mode : Normal 

Codon Table : Universal 



9 18 21 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



®100 



M 



w 



V 



V H 



63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



v 



V Q 



V K K 



117 126 135 144 1S3 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



K V 



K 



W 



171 180 189 195 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



W V 



K 



W V 



225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



N N 



N 



K 



K 



V 



V 



279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



D 



S 



KT 



M 



333 342 351 360 369 373 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT CCC TGG TTT GCT TAC 



V 



D 



W 



367 396 405 414 423 432 

TGG GGC CAA GGG ACT CXG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



V 



V 



441 450 459 468 477 4BS 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPL.APSSK5TSGGTAAL 



495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



V 



V 



V 



W N 



549 S58 567 576 585 594 

GGC GCC CTG ACC ACC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



H 



P A V L 

- ^ - 



9 
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603 612 621 S30 633 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPS SSLGTQ 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG _ AAA 

TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

CTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 

VEPKSCDKTHTCCVEC P P 

765 774 763 792 . 301 810 

TGC CCA GCA CCT GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT 

cpapegglkiaa 'fniqtf 

819 828 837 846 855 B64 

GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG 

GETKMSNATLiVSYIVQXL 

873 882 891 900 909 918 

AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC 

SRYDIALVQEVRDSHLTA 

927 936 945 954 963 972 

GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC 

VGKLLDNLNQDAPDTYHY 

981 990 999 1008 1017 1026 

GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG 



VVSEPLGRNSYKERYLPV 

1035 1044 1053 1062 1071 1080 

TAC AGG CCT GAC CAG GTG TCT CCC CTG GAC AGC TAC TAC TAC CAT CAT CCC TCC 

YRPDQVSAVDSVYYDDG C 

10B9 1096 ' 1107 1116 1125 1134 

GAG CCC TGC GGG AAC 'GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC 

EPCGNDTFNREPAIVRFF 

1143 ^ 1152 1161 1170 1179 1188 

TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG 



RFTEVREFAIVPLHAAP 

1197 1206 1215 1224 1233 1242 

- 2- 
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* GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA 

GDAVAEIDALYDVYLDVQ 



1251 1260 1269 1278 1287 1296 

GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC 

E KWG L E DVMLMG D F N A G C 

1305 1314 1323 1332 1341 1350 

AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC 

SYVRPSQWSS'I RLWTS P T 

13S9 1368 1377 1386 1395 1404 

TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT 

FQWLIPDS.ADTTATPTHC 

1413 1422 1431 1440 1449 1458 

GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CCA CCG GCC GTT GTT CCC 

A Y D ft I V V A G M I. L R G A V V P 

1467 1476 1485 1494 1503 1512 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG 

D- SAL PFNF'Q AAYGL SDQL 

1521 1530 1539 1548 1557 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

AQAISDHYPVSVMLK* 



- 3 - 
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(g) P AS104 



LOCUS PAS104.DNA 1560 bp mRNA PRI 06-MAR-1995 

DEFINITION Humanised HMFG1 Fab r 2 fused to human DNase I (pAS104) 

Position 924 G to A by ggg to gag 

Linker GR instead of GG (position 777) 

ACCESSION 

NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASEl.dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
AUTHORS Shak,S-, Capon, D. J. , Hellmiss , R. , Marsters, S . A. and Baker, C.L. 
TITLE Recombinant human DNase I reduces the viscosity of cystic fibrosis 

sputum 

JOURNAL Proc. Natl. Acad- Sci. U,S.A. 87 (23), 9188-9192 (1990) 

MEDLINE 91067 572 
BASE COUNT 34 6 a 4 68 c 4 34 g 312 t 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT G AC AG T C AC T AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT" ACTGTGCAAG ATCCTACGAC 
3 61 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
491 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGCCGT GCACJyCCTTC CCGGCTGHCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTCC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT ' 
721 GACAAAACTC ACAC ATGCTG TGTGGAGTGC CCACCGTGCC CAGCACCTGA AGGCSGG CTG 
731 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 
841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC T AC G AC AT CG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT GG§GAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 

9 61 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 

1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 

1081 GAGCCCTGCG G GAACGAC AC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 

1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 

1201 GCCGAGAT CG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 

1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 

1321 TCATCCATCC GCC TGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 

1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 

14 41 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 

1501 AGTGACCAAC TGGCCCAAGC CAT C AG T G AC CACTATCCAG TGGAGGTGAT GCTGAAGTGA 

// 



59 
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Mode t Normal 



9 18 27 36 45 54 

5' ATC GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



M 



W 



V 



V H 



63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG' GCC TCA 



V 



V K K 



117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



V K V 



K 



W 



171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



W V 



K 



W V 



225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



N N 



R 



K 



K 



V 



V 



279 288 297 306 315 324 

ACA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



M 



333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC CAC TTT GCC TGG TTT GCT TAC 



D 



V 



387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC JLCA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



W 



V 



K 



441 450 459 468 477 485 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



K 



495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



G 



V K 



D 



V 



W N 



549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



A 



V 



P A V 
- 1 - 



Q 
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603 612 621 63 0 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPSSSLGTQ 

.657 666 675 684 ■ 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 

VEPKSC-DKTH .TCCVECPP 

■765 lid 783 792 801 810 

TGC CCA GCA CCT GAA GGC AGG CTG AAG ATC GCA GCC .TTC AAC ATC CAG ACA TTT 

CPA PBGRLKIAAPNI QTP 

819 828 837 846 S55 864 

GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG 

GETKM5NATLV5YIVQI L 

873 882 891 900 909 918 

AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC 

S RYDIALVQEVRDSHLTA 

927 936 945 954 963 972 

GTG GAG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC 



V 


E K 


L 


L D 


N 


L N 


Q 


D A 


P 


D T 


Y 


K Y 


GTG 


981 
GTC AGT 


GAG 


990 
CCA CTG 


GGA 


999 
CGG AAC 


AGC 


1008 
TAT AAG 


GAG 


1017 
CGC TAC 


CTG 


1026 
TTC GTG 


V 


V S 


E 


P L 


G 


R N 


S 


Y K 


E 


R Y 


L 


F V 


TAC 


1035 
AGG CCT 


GAC 


1044 
CAG CTC 


TCT 


1053 
CCC CTC 


GAC 


1062 
AGC TAC 


TAC 


1071 
TAC GAT 


GAT 


1080 
GGC TGC 


Y 


R P 


D 


Q v 


s 


A V 


D 


s y 


Y 


Y D 


D 


C C 


GAG 


1069 
CCC TGC 


GGG 


1099 
AAC GAC 


ACC 


1107 
TTC AAC 


CGA 


1116 
GAG CCA 


GCC 


1125 
ATT GTC 


AGG 


1134 
TTC TTC 


E 


P c 


G 


N JO 


T 


F N 


R 


E P 


A 


I V 




r F 


TCC 


1143 

CGG TTC 


ACA 


1152 
GAG GTC 


AGG 


1161 
GAG TTT 


GCC 


1170 
ATT GTT 


CCC 


1179 
CTG CAT 


GCG 


1188 
GCC CCG 


s 


R F 
1197 


T 


E V 
1206 


R 


E F 
1215 


A 


I V 
1224 


p 


L H 
1233 


A 


A P 
1242 



-a 



iff 
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GGG 


GAC GCA 


GTA 


GCC GAG 


ATC 


GAC GCT 


CTC 


TAT GAC 


GTC 


TAC CTG 


GAT 


GTC CAA 


G 


D A 


V 


.A E 


I 


D A 


L 


Y D 


V 


Y L 


D 


V Q 




1251 




1260 




1269 




1278 




12S7 




1296 


GAG 


AAA TGG 


GGC 


TTG GAG 


GAC 


GTC ATG 


TTG 


ATG GGC 


GAC 


TTC AAT 


GCG 


GGC TGC 


























E 


K W 


G 


L E 


D 


V M 


L 


M G 


D 


F N 


A 


G C 




1305 




1314 












J. J fiX 




1 J JU 


AGC 


TAT GTG 


AGA 


CCC TCC 


CAG 


TGG TCA 


TCC 


ATC CGC 


CTG 


TGG ACA 


AGC 


CCC ACC 


s 


Y V 


R 


P S 


Q 


W S 


S 


I R 


L 


W T 


S 


P T 




1353 




1366 




137"? 




1386 




i o t\ ~ 
139o 




1404 


TTC 


CAC TCG 


CTG 


ATC CCC 


GAC 


AGC GCT 


GAC 


ACC ACA 


GCT 


ACA CCC 


ACG 


CAC TGT 


F 


Q W 


L 


I P 


D 


S A 


D 


T T 


A 


T P 


T 


H C 




1413 




1422 




1431 




1440 




1449 




1458 


GCC 


TAT GAC 


AGG 


ATC GTG 


GTT 


GCA GGG 


ATG 


CTG CTC 


CGA 


GGG GCC 


GTT 


GTT CCC 


























A 


Y D 


R 


X V 


V 


A G 


M 


L L 


R 


G A 


V 


V P 



1467 1476 1485 1494 1503 1312 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG 



DSALPFNFQAAYGLSDQ 

1521 -1530 1539 1548 1557 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3' 

AOAISDHYPVEVMLK + 



-3- 
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(A) PAS105 



LOCUS PAS 105. DN A 1578 bp mRNA PRI 06-MAR-1995 

DEFINITION Humanised HMFGl Fab ' 2 fused to human DNase I with SV40 NLS<pAS105) 
ACCESSION 
NID 

KEYWORDS DNase I . 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASE1 . dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Homlnidae; Homo. 
AUTHORS Shak,S. , Capon, D. J., Hellmiss , R . , Marsters , S . A. and Baker,C.L. 
TITLE Recombinant human DNase I reduces the viscosity of cystic fibrosis 

sputum 

JOURNAL Proc. Natl. Acad. Sci . U.S.A. 87 (23), 3188-9192 (1990) 

MEDLINE 91067672 
BASE COUNT 353 a 473 c 442 g 310 t 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 

2 41 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 

3 01 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 

3 61 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
■4 21 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 

4 81 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
S61 AACGTGAATC ACAAGCCCAG CAACl^CCAAG GTGGAOAAGA AAGTTGAGCC CAAATCTTGT 
7 21 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGA& G GCGGG CTGAA GATCGCAGCC 
781 TTCAACATCC AGACATTTGG GGAGACCAAG ATGTCCAATG CCACCCTCGT CAGCTACATT 
3 41 GTGCAGATCC TGAGCCGCTA CGACATCGCC CTGGTCCAGG AGGTCAGAGA CAGCCACCTG 
9 01 ACTGCCGTGG GGAAGCTGCT GGACAACCTC AATCAGGACG CACCAGACAC CTATCACTAC 
9 51 GTGGTCAGTG AGCCACTGGG AC GGAACAGC TATAAGGAGC GCTACCTGTT CGTGTACAGG 

1021 CCTGACCAGG TGTCTGCGGT GGACAGCTAC TACTACGATG ATGGCTGCGA GCCCTGCGGG 
1081 AACGACACCT TCAACCGAGA GCCAGCCATT GTCAGGTTCT TCTCCCGGTT CACAGAGGTC 
1141 AGGGAGTTTG CCATTGTTCC CCTGCATGCG GCCCCGGGGG ACGCAGTAGC CGAGATCGAC 
1201, GCTCTCTATG ACGTCTACCT GGATGTCCAA GAGAAATGGG GCTTGGAGGA CGFCATGTTG 
1261 ATGGGCGACT TCAATGCGGG CTGCAGCTAT GTGAGACCCT CCCAGTGGTC ATCCATCCGC 
1321 CTGTGGACAA GCCCCACCTT CCAGTGGCTG ATCCCCGACA GCGCTGACAC CACAGCTACA 
1381 CCCACGCACT GTGCCTATGA CAGGATCGTG GTTGCAGGGA TGCTGCTCCG AGGGGCCGTT 

14 41 GTTCCCGACT CGGCTCTTCC CTTTAACTTC CAGGCTGCCT ATGGCCTGAG TGACCAACTG 
1501 GCCCAAGCCA TCAGTGACCA CTATCCAGTG GAGGTGATGC TGAAGGGGGG CGGACCCAAA 

15 61 AAGAAGCGCA AGGTT TGA / 
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Codon Table : Universal 
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Mode : Normal 



B 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



BI108 



M 



W 



V 



H 



63 72 81 90' 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



V 



V O 



V K K 



117 126 135 144 153 162 

GTG AAG GTG TCC TCC AAG GCT TCT GCC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



K 



K A 



w 



171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



W V 



K 



W 



225 234 243 252 261 ,270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



N N 



K 



K 



V 



V 



279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



•s 



N 



M 



333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



V 



W 



387 396 405 414 4.23 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA CCC TCC ACC AAG GGC CCA TCG 



T 



T V 



A 



441 450 459 4S8 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAFSSKSTSG'GTAAb 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



V 



X D Y 



V 



V 



549 558 567 575 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



VH TFPAVIjQ 

-1 - 



Received 03-Apr-00 15:24 



From-01159 552201 



to 
if 



To-THE PATENT OFFICE 



Page 99 



03/04 '00 MON 14:56 FAX 01159 552201 



ERIC POTTER CLARK SON 



©109 




603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



V 



V 



657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



N 



N 



H K 



N 



V 



K K 



711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



V 



K 



K 



H 



■765 774 783 792 801 810 

GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 



K 



K 



819 828 837 846 855 864 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTC ACC CCC TAC GAC 



V 



873 982 891 900 909 918 

ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG 



V 



V 



H 



927 936 945 954 963 972 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 



L 



N 



N 



H 



V V 



981 990 999 1008 1017 1026 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 



N 



K 



V 



R 



1035 1044 1053 1062 1071 1080 

CAC CTC TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG 



V 



A 



S 



1039 1098 1107 1116 1125 1124 

AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CCC TTC ACA 



N 



R 



1143 1152 1161 1170 1179 1188 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 



V 



H 



D A V 



1197 



1206 



1215 



1224 
-2- 



1233 



1242 



to- 
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GCC GAG ATC GAC GCT CTC TAT GAC GXC TAC CTG GAT GTC CAA GAC AAA TCG GGC 
A E TDAli'YDVYLDVQB KWG 



1251 1260 1269 1278 1287 1296 

TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA 




Kliio 



D 



M 



M 



D 



N 



V 



1305 1314 1323 1332 1341 1350 

CCC TCC CAG TGG TCA TCC ATC CGC CTC TGG ACA AGC CCC ACC TTC CAG TGG CTG 



W 



W 



w 



1359 1368 1377 1386 1395 1404 

ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 

-rwnSAnTTATFTHCAYDR 



1413 1422 1431 1440 1449 1458 

ATC GTG GTT GCA CCC ATG CTG CTC CCA CCC GCC GTT GTT CCC GAC TCG GCT CTT 



I 


V v 


A 


G M 


Lr 


Li R 


G 


A V 


V 


P D 


S 


A L 




1467 




1476 




1*185 




1494 




1503 




1512 


CCC 


TTT AAC 


TTC 


CAG GCT 


GCC 


TAT GGC 


CTG 


AGT GAC 


CAA 


CTG GCC 


CAA 


GCC ATC 


























p 


V N 


F 


Q A 


A 


r G 


1> 


S D 


Q 


i* a 


Q 


A 1 




1521 




1530 




1539 




1548 




1537 




1566 


AGT 


GAC CAC 


TAT 


CCA GTG 


GAG 


GTG ATG 


CTG 


AAG GGG 


GGC 


GGA CCC 


AAA 


AAG AAG 


S 


D H 


V 


P V 


E 


V M 


u 


K G 


\3 


G P 


K 


K K 



1575 

CGC AAG GTT TGA 3' 



K V 



-3- 
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jfy pAS106 



LOCUS PAS106.DNA 1596 bp mRNA PRI 06-MAR-1995 

DEFINITION Humanised HMFG1 Fab 1 2 fused to human DNase I with SV40 NLS(pAS106) 
ACCESSION 
NID 

KEYWORDS 
SOURCE 



DNase I. 

DNase I sequence is from assembled oligos {thus modified c/f 



MHDNASEl.dna) 

ORGANISM Homo sapiens 



AUTHORS 
TITLE 

JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



Eukaryotae; mitochondrial eukaryotes; Metazoa? Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S., Capon,D.J., Hellmiss , R. , Marsters, S . A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic fibrosis 
sputum 

Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9138-9192 (1990) 
91067672 

355 a 475 c 452 g 314 t 



1 AT G G GAT GG A 
61 GTGCAGCTGG 
121 * TGCAAGGCTT 
1S1 GGAAAGGGCC 

2 41 GAGAAGTTCA 
301 GAG C T C AG C A 

3 61 TTTGCCTGGT 

4 21 AAGGGCCCAT 
4 81 GCCCTGGGCT 
541 GGCGCCCTGA 

601 TCCCTCAGCA 

66± AACJC^TGAATCJ 

721 GACAAAACTC 

781 GGGCTGAAGA 

8 41 ACCCTCGTCA 
901 GTCAGAGACA 

9 61 CCAGACACCT 
1021 TACCTGTTCG 
1031 GGCTGCGAGC 
1141 TCCCGGTTCA 
1201 GCAGTAGCCG 
12 61 TTGGAGGACG 
1321 CAGTGGTCAT 
1381 GCTGACACCA 

14 41 CTGCTCCGAG 
1501 GGCCTGAGTG 

15 61 AAGGGGGGCG 



GCTGTATCAT 
TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 

GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCTG 



CCTCTTCTTG 
GGCAGAGGTG 

CTTCAGTGCC 
CGGAGAG AT T 
G AC AG T C AC T 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 

CGTGCCCTCC 
CAAC AC CAAG 

TGTGGAGTGC 



GTAGCAACAG 

AAAAAGCCTG 

TACTGGATAG 
TTACCTGGAA 
AGAGACACA? 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 

CCGGCTGTCC 

AGCAGCTTGG 
GTGGACAAGA 

CCACCGTGCC 



CTACAGGTGT 
GGGCCTCAGT 
AGTGGGTGCG 
G TAAT.AAT T C 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAG T T GAG CC 
CAGCACCTGA 



CCACTCCCAG 

GAAGGTGTCC 

CCAGGCTCCA 

TAGATACAAT 

AGCCTACATG 

ATCCTACGAC 

AGCCTCCACC 

GGGCACAGCG 

GTGGAACTCA 

AGGACTCTAC 

CTACATCTGC 
CAAATCTTGT 

AGGGAGCGGC 



TCGCAGCCTT 

GCTACATTGT 
GCCACCTGAC 
ATCACTACGT 
TGTACAGGCC 
CCTGCGGGAA 
CAGAGGTCAG 
AGATCGACGC 
TCATGTTGAT 
CCATCCGCCT 
CAGCTACACC 
GGGCCGTTGT 
ACCAACTGGC 
GACCCAAAAA 



// 



CAACATCCAG 
GCAGATCCTG 
TGCCGTGGGG 
GGTCAGTGAG 
TGACCAGGTG 

CGACACCTTC 
GGAGTTTGCC 
TCTCTATGAC 
GGGCGACTTC 
GTGGACAAGC 
CACGCACTGT 
TCCCGACTCG 
CCAAGCCATC 
GAAGCGCAAG 



ACATTTGGGG 
AGCCGCTACG 
AAGCTGCTGG 
CCACTGGGAC 
TCTGCGGTGG 
AACCGAGAGC 
ATTGTTCCCC 
GTCTACCTGG 
AATGCGGGCT 
CCCACCTTCC 
GCCTATGACA 
GCTCTTCCCT 
AGTGACCACT 
GTTTGA 



AGACCAAGAT 
ACATCGCCCT 
ACAACCTCAA 
GGAACAGCTA 
ACAGCTACTA 
CAGCCATTGT 
TGCATGCGGC 
ATGTCCAAGA 
GCAGCTATGT 
AGTGGCTGAT 
GGATCGTGGT 
TTAACTTCCA 
ATCCAGTGGA 



GTCCAATGCC 
GGT CCAGGAG 
T C AG GACGCA 
TAAGGAGCGC 
CTACGATGAT 
CAGGTTCTTC 
CCCGGGGGAC 
GAAATGGGGC 
GAGACCCTCC 
CCCCGACAGC 
TGCAGGGATG 
GGCTGCCTAT 
GGTGATGCTG 
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' File : PAS10 6.DTTA 

Range ; 1 - 1596 Mode : Normal 

Codon Table : Universal 



9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 

MGWSCIILFLVATATGVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

SQVQLVQSGAEVKKPGAS 

117 126 135 144 153 162 

GTG AAG GTG' TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TCG ATA. GAG 

VKVSCKASGVTFS AV 



171 ISO 189 19B 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKGLEWVGEILP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAG AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSRYSTEKirKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMELSSLRSS 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

D TAVYYCAR S YD FAWFAY 

337 39S £05 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA CCC TCC ACC AAG GGC CCA TCG 

WGQGTLVTVSSASTKGPS 

441 450 459 463 477 436 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAFS5K5TSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCIiVKDYFPSPVTVSWNS 

549 55S 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGVHTFPAVLQSSG 

- 1 - 
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GCC CCG CCC CAC GCA. CTA CCC CAC ATC CAC GCT CTC TAT GAC GTC TAC CTG GAT 






p d 


D 


A V 


A 


5 I 


D 


A L 


Y 


D V 


Y 


L D 


GTC 


1251 
CAA GAG 


AAA 


1260 
TGG GGC 


TTG 


1269 
GAG GAC 


GTC 


1278 


ATC 


1287 
GGC GAC 


TTC 


1296 
AAT GCG 


V 


Q E 


K 


W G 


li 


E U 


V 


M U 


M 


G D 


F 


N A 


GGC 


1305 
TGC AGC 


TAT 


1314 
GTG AGA 


CCC 


1323 
TCC CAG 


TGG 


1332 


ax i« 


1341 


TGG 


1350 
ACA AGC 


G 


C s 


Y 


V R 


P 


S Q 


W 


s s 


I 


R L 


W 


T S 


■ 

CCC 


1359 
ACC TTC 


CAG 


1368 
TGG CTG 


ATC 


1377 
CCC GAC 




1386 

rvr far** 
tit- 1 




1395 


ACA 


1404 
CCC ACG 


p 


T F 


Q 


W L 


I 


P 1 D 


S 


A D 


T 


T A 


T 


P T 


CAC 


1413 
TGT GCC 


TAT 


1422 
GAC AGG 


ATC 


1431 

GTG GTT 




1440 


TTG 

V* -L VT 


1449 
CTC CGA 


GGG 


1458 
GCC GTT 


H 


C A 


Y 


D R 


I 


V V 


A 


G M 




I* R 


G 


A V 


GTT 


1467 
CCC GAC 


TCG 


1A7 6 
GCT CTT 


CCC 


1485 
TTT AAC 


TTC 


1494 
CAG GCT 


GCC 


1503 
TAT GGC 


CTG 


1512 
AGT GAC 



VFDSALPFWFQAAYGX.SD 

152 1 1530 1539 1549 1557 1566 

CAA CTG GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG 

OLA0AI 5DHYPVEVMLKG 



1575 1584 1593 

GGC GGA CCC AAA AAG AAG CGC AAG GTT TGA 3' 



GGPKKKRK 




fa 
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603 612 621 630 *39 648 

CTC TAG TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYSLSSVVTVPSSSL GTQ 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



N 



N H K 



N 



K 



747 



K K 



756 



711 720 729 73S 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 



V 



K 



K 



V 



765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG 



N 



B19 828 837 846 855 864 

ACA TTT GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC ACC TAC ATT GTG CAG 



K H 



N 



V 



873 882 891 900 909 918 

ATC CTG AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG 

ILSRYDIALVQBVRD5. HL 

927 936 945 95<L S63 972 

ACT GCC GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT 



V 



X 



N 



N 



D 



Y ■ 



981 990 999 1008 1017 1026 

CAC TAC GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG 



H 



V V 



K 



1035 1044 1053 1062 1071 1080 

TTC GTG TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT 



V 



D 



Q V 



V 



1089 1098 1107 1116 1125 1134 

GGC TGC GAG CCC TGC GGC AAC CAC ACC TTC AAC CGA CAC CCA CCC ATT GTC AGG 



G N 



N 



A 



V 



1143 1152 1161 1170 1179 1188 

TTC TTC TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG 



FFSRFTEVREFAI 

1197 1206 1215 1224 

-2- 



F L H A 
1233 1242 
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ft) PAS107 



LOCUS 



PAS107.DNA 1590 bp mRNA PRI 06-MAR-1995 

DEFINITION Humanised HMFG1 Fab' 2 fused to human DNase I with SV40 NLS(pAS107) 
ACCESSION 
NID 

KEYWORDS 
SOURCE 



DNase I- t 

DNase I sequence is from assembled oligos {thus modified c/f 



MHDNASEl . dna) 

ORGANISM Homo sapiens 



AUTHORS 
TITLE 

JOURNAL 
MEDLINE 

BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 

541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
. 1381 
1441 
1501 

1561 

// 



Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates? Catarrhini; Hominidae; Homo. 
Shale, S., Capon, D.J. , Hellmiss,R., Marstera, S -A« and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic fibrosis 
sputum 

Proc. Natl, Acad. Sci. U.S.A. 37 (23), 9189-9192 (1990) 
91067672 

354 a 474 c 448 g 314 t 



ATGGGATGGA 
GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
SAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
AAGATCGCAG 

GTCAGCTACA 
GACAGCCACC 
ACCTATCACT 
TTCGTGTACA 
GAGCCCTGCG 
TTCACAGAGG 
GCCGAGATCG 
G AC GT CAT GT 
TCATCCATCC 
ACCACAGCTA 
CGAGGGGCCG 
AGTGACCAAC 
GGCGGACCCA 



GCTGTATCAT 
TGCAGTCTGG 
CT G G C T AC AC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCTG 



CCTCTTCTTG 
GGCAGAGGTG 
CTTCAGTGCC 
GG GAG AG ATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GG AC T AC T T C 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
TGTGGAGTGC 



GTAGCAACAG 

AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
CCACCGTGCC 



CTACAGGTGT 

GGGCCTCAGT 

AGTGGGTGCG, 

GTAATAATTC 

CCACAAACAC 

ACTGTGCAAG 

CAGTCTCCTC 

GCACCTCTGG 

TGACGGTGTC 

TACAGTCCTC 

GCACCCAGAC 

AAGTTGAGCC 

CAGCACCTGA 



CCACTCCCAG 
GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
AT C C T AC G AC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
AGGCGGGCTG 



CCTTCAACAT 
TTGTGCAGAT 
TGACTGCCGT 
ACGTGGTCAG 
GGCCTGACCA 
GGAACGACAC 
TCAGGGAGTT 
ACGCTCTCTA 
TGATGGGCGA 
GCCTGTGGAC 
CACCCACGCA 
TTGTTCCCGA 
TGGCCCAAGC 
AAAAGAAGCG 



CCAGACATTT 
CCTGAGCCGC 
GGGGAAGCTG 
TGAGCCACTG 
GGTGTCTGCG 
CTTCAACCGA 
TGCCATTGTT 
TGACGTCTAC 
CTTCAATGCG 
AAGCCCCACC 
CTGTGCCTAT 
CTCGGCTCTT 
CAT CAGT G AC 

CAAGGTTTGA 



GGGGAGACCA 
TACGACATCG 
CTGGACAACC 
GGACGGAACA 
GTGGACAGCT 
GAGCCAGCCA 
CCCCTGCATG 
CTGGATGTCC 
GGCTGCAGCT 
TTCCAGTGGC 
GACAGGATCG 
CCCTTTAACT 
CACTATCCAG 



AGATGTCCAA 
CCCTGGTCCA 
TCAATCAGGA 
GCTATAAGGA 
ACTACTACGA 
TTGTCAGGTT 
CGGCCCCGGG 
AAGAGAAATG 
ATGTGAGACC 
TGATCCCCGA 
TGGTTGCAGG 
TCCAGGCTGC 
TGGAGGTGAT 



TGCCACCCTC 
GGAGGTCAGA 
CGCACCAGAC 
GCGCTACCTG 
TGATGGCTGC 
CTTCTCCCGG 
GGACGCAGTA 
GGGCTTGGAG 
CTCCCAGTGG 
CAGCGCTGAC 
GATGCTGCTC 
CTATGGCCTG 
GCTGAAGGGG 
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Mode : Normal 



45 



54 



9 18 27 36 

ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



M 



W 



V H 



S3 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



V K K 



XX 7 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GCC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



©116 



K 



K 



W 



171 180 189 196 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



W V 



K 



W V 



2 25 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



G 



N N 



N 



K 



V 



V 



279 288 297 306 315 324 

AGA GAC ACA ' TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



N 



M 



R 



333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



V 



w 



3fi7 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA CCC TCC ACC AAG GGC CCA TCG 



W 



T ' V 



441 450 459 463 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



V 



K 



495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



V K 



V 



V 



W N 



549 553 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



V 



F P A V L Q 

- 1 - 



7L 
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603 612 621 €30 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

XjYSLSSVVTVPSS-SIjGTQ- 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TYIC-NVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 

VEPKSC'DKTHTCCVECPP 

765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT 

C P A P E G G I/KIAAFNI QTF 

819 828 837 846 855 864 

GGG GAG ACC AAG ATG TCC AAT CCC ACC CTC GTC AGC TAC ATT CTC CAG ATC CTG 

G.ETKMSNATLVSYIVQIL 

873 882 891 900 909 919 

AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC 

S R Y D I A L V Q E V R .D S H 3j T A 

927 936 945 9541 963 972 

GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC 

VGKLLD NLNQD AP DTYH Y 

981 990 999 1008 1017 1026 

GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG 

VVSSPLGRNSYKERYLFV 

1035 1044 1053 1062 1071 1080 

TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC 

YRPDQV'SAVDSYYYDDGC 

1089 1098 1107 ' 1116 1125 1134 

CAC CCC TCC CGG AAC GAC ACC TTC AAC CGA GAG CCA CCC ATT CTC AGG TTC TTC 

EPCGNDTFttREPAIVRFF 

1143 1152 1161 1170 1179 1168 

TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG 

SR.FTEVREFA IVPLrHAA p 

1197 1206 1215 1224 1233 1242 

-2- 
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f\ a GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAG CTG GAT CTC CAA 

' GDAVAEI DALYDVYLDVQ 

12S1 1260 1269 1278 1267 1296 

GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC 



K W 



V M 



M 



N 



1305 1314 1323 1332 1341 1350 

AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC 



V 



Q - W 



1359 1368 1377 1386 1395 1404 

TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT 



a 118 



w 



H 



.1413 1422 1431 1440 1449 1458 

GCC TAT GAC AGC ATC CTC GTT GCA GGG ATG CTC CTC CGA GGG GCC GTT GTT CCC 



V 



A 



M 



A 



1467 147Q 1435 1494 1503 1512 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG 

DSALPFNFQA. AYGLS D Q I, 

1521 1530 1539 154B 1557 1566 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA 



H 



V 



V M 



K 



1575 15B4 
CCC AAA AAG AAG CGC AAG GTT TGA 3' 

PKKKRKV* 



-3- 



'fa 
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